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CELL-BASED ASSAY 

Technical Field of the Invention 

This invention relates to materials and methods for the identification of inhibitors of certain 
5 molecular interactions. 

Background of the Invention 

Cellular signal transduction, i.e., the series of events leading from extracellular events to 
intracellular sequelae, is an aspect of cellular function in both normal and disease states. Numerous 

0 proteins that function as signal transducing molecules have been identified, including receptor and 
nonreceptor tyrosine kinases, phosphatases and other molecules with enzymatic or regulatory 
activities. These molecules generally demonstrate the capacity to associate specifically with other 
proteins to form a signaling complex that can altar ceil activity. 

Signaling proteins often contain domains of conserved sequence which serve as non-catalytic 

5 modules that direct protein-protein interactions during signal transduction. One such domain is the 
Src homology domain 2 (SH2), which is found in various combinations and locations. in different 
proteins. For example, some members of the Src-family of tyrosine kinases, e.g., Abl, GRB2 and 
P13K, each contain one SH2 domain. Other tyrosine kinases, such as ZAP and Syk, contain two SH2 
domains. The presence of multiple SH2 domains within a protein increases the variety of potential 

3 protein-protein interactions. 

SH2 domains direct the association of specific proteins or protein domains by binding 
selectively and with specificity to protein sequences or motifs containing phosphotyrosine. For 
example, upon ligand binding, the PDGF beta-receptor dimerizes and autophosphorytates multiple 
tyrosine residues. This phosphorylation of tyrosine-containing protein motifs within the receptor 

5 triggers its physical association with SH2-containing proteins such as c-src, PLC-gamma, PI3K 
and ras-GAP, forming a signaling complex. Other examples include the binding of the tandem SH2- 
containmg protein Syk to tyrosine-phosphorylated motifs on the beta or gamma subunrts of the IgE 
receptor, and the binding of the tanoem SH2-containing protein ZAP-70 with tyrosine- 
phosphorylated motifs on subunits of the T cell receptor. Other protein receptors for tyrosine- 

) phosphorylated protein domains are known and include the so-called phosphotyrosine binding 
domains ("PTBs") or phosphotyrosine interaction domains ("PIDs"). Many signaling pathways 
which play critical roles in disease processes are mediated by the binding of a phosphotyrosine- 
containing protein or protein domain with an SH2 or other protein receptor for a tyrosine- 
phosphorylated domain. 

> Pharmaceutical agents which interfere with the formation or stability of such signaling 

complexes may be used for precise intervention in these complex biological processes in order to 
treat or prevent the diseases or pathological effects mediated by such signaling. However, one 
common but significant problem when screening for pharmaceuticalry useful compounds that 
disrupt specific cellular events is deciphering their mechanism and specificity of action. Typically 

) one searches for inhibition of a signaling cascade by measuring the effect of test compounds on an 
event several, rf not many, steps downstream of a protein-protein interaction of particular 
interest. To the extent that a compound may exert its effects on any number of intermediate steps, 
the results of such experiments can be difficult to interpret. 

A so-called -two-hybrid" interaction assay described by Song and Fields, Nature. 340:245- 
247 (1989) has been used to detect the interactions of a variety of molecules. See also. Fields et 
at, US Patent No. 5,283.173 (1 Feb 1994). The two-hybrid assay is based on the observation that 
transcription factors contain separable functional modules that direct either DNA binding or 
transcription activation. A DNA binding domain expressed in cells will bind to DNA but not activate 
transcription as it lacks a transcription activation domain. Conversely, a transcription activation 
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rtomain alone will not affect transcription in the absence of directed and/or intimate interaction 
2T DNA such as would be provided by a DNA-bindmg domain. However, it the DNA binding domain 
3JS uans^n ption activation doma^s are each expressed as par, of separate fusion p = . and 
me fusion proteins are capable of assoaat.ng. the two-hybrid" complex so formed represents a 
'iSH ^ transcription factor (see FIG. 1). Such a reconstituted transection factor ,s capable 
7S!^v£ZZZ" °< * re P° ner 9 ene 3 9ane ,0f 8 conveniemty d^ec^Ne marKar 
I l,TZ peta-oalactosidese or alkaline phosphatase (SEAP) or a protein important for ce« 
jSm) SS5SSi«n o, DNA binding sttes recognized by the DNA-bindjng = ^ 
viaDmiy) oca expression, i.e.. the amount of gene product produced, will reflect the 

erteTtc S ?us^n totems comp.ex with one another. Compounds that block the assoa.tion 

° 1 *°^ZX^^^^^^"^ — ed SH 2 .d.penden, Proton- 

sr ™n n r r s srrax 

SJSSitSS-ii -vo.ving phospnorylated lig.nds and their receptors rChaudhur, B. 

IT FFBS Lei 357 221-226 (1995)]. While a yeast-based two-hybrid approach may be 
usetui fo^oLr purposes % would not lend itself to screening for drugs to inhibit phosphopept.de- 
maSated inS?.rfon. « mammal.an cells since the use of yeast cells would limit the researcher, 
or least ti W to the subset ot compounds which are able to penetrate the yeast cel. wall. 
This artmcial limitation would incorporate into the expenmental design the risk ot missing 
important new compounds capable o, blocking key s.gna.ing interactions in mammalian ce.ls. but 

^ C %TereThus%em P rn:TnVe^\ C ,rart for improved methods and compositions for .denting 
inhibits of phosphopept.de-med,a,ed protem-protein binding in a wide variety o. host cells. 

'""Th^pr^nnnvenrn addresses this need by providing a mechamsm-based in v,vo assay that 
a ,lows «he P identifica..on o. inhibitory compounds that targe, Key molecules involved in specific 
sig na. transduct.on pathways^ ^ ^ ^ recombinant DNA 

The ,nv «™'°" P r ° proteins One DNA molecule encodes a transenptton activating 

fusion protein wnicn^ cu w brnd air , e a pno sphopeptide binding domain or peptide 

r iTZZ AnomerDNr^c^e encodes a DNA-b.nd.nrtusion protein which comprises one 

The cen further contains a reporter gene whidh is linked to a DNA "^^f.^ °™ uCt 
binding fus,on protein ,s capable of binding. The reporter gene en»tai ^^^^SSin 
wh.ch is exoressed when transcription ot the reporter gene is activated, e.g. following assoc.ai.on 
Z £l tlo fusTn orowms to reconstitute an effective transection factor. The engineered cell 
° Ilr^rtZ -a oToteTn Kinase capable ol phosphorating one or more ot the tyrosine residues 
2 me oeot SeTiaand asTs required tor two-nvbrid formation The kinase may be endogenous to the 
cL^or mSv oe he oene pmduct of an introduced, hetero.ogous gene. In on. embodiment one of the 
cell or may be the gene ' P r °°" provides its own k.nase function. One or both o» 

fusion proteins contains a kinase oomain ano uiu» u>v <HBD1 nermtttinq 

, . »r,tinn«iiv contain a nuclear hormone-binding domain (i-itsu) perrmiiniy 

Z££^.£ y o^^™"» ><«™ SUCh -bodiments. addition of the 

^rmone renders the HBD-beanng fusion prote.n available tor two-hybnd formation When the 
ct.rexp™s«s the ^o Son protems. and me peptide bg.nd domain has become tyrosine 
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phosphorylated. the two fusion proteins are capable of binding to each other via comptexation of the 
pnosphorylated peptide hgand of one fusion protein with the PBD of the other. So complexed, the 
fusion proteins are capable of detectabiy activating the transcription of the reporter gene linked to 
a DNA sequence recognized (i.e., bound) by the DNA-bmding domain of one of the fusion proteins 
3 One object of the invention is to provide the above-described DNA molecules. Another is to 

introduce the DNA molecules into host cells to provide geneticalry engineered cells useful tor 
conducting the assay methods described herein. 

Another object of the invention is to provide a method for identifying the presence in a test 
composition of an inhibitor of the binding of a tyrosine-phosphorytated ligand with a 
0 phosphopeptide binding domain therefor or of a biological activity mediated by that phosphopeptide 
binding pair. This method involves culturmg or maintaining the above-described geneticalry 
engineered cells in a medium suitable for cell growth, in the presence and absence of the test 
composition, and determining whether production of the detectable reporter gene product was 
diminished m the presence of the test composition. In practice, the test composition is added to the 
5 cells, e.g. to the medium in which the cells are cultured, and the culture is incubated under 
conditions permitting formation of a complex between the fusion proteins. In embodiments 
involving hormone regulation, the hormone may be added before, during or. as may often be 
preferable, after addition of the test composition, ff binding of the DNA-binding fusion protein to 
tne transcription activating fusion protein occurs to a lesser extent in the presence of the test 
3 composition than in fts absence, i.e.. if the presence or increased concentration of the test 
composition reduces the concentration of two-hybrid complex, then the test composition is or 
contains a pnosphopeptide binding pair inhibitor. The presence or absence of two-hybrid complex 
may be monitored by measuring the the level of expression of the reporter gene. 

To recap, an illustrative in vivo assay format relies upon genetically engineered cells capable 
of expressing a reporter gene under PBP-mediated transcriptional control. These cells contain a 
phosphopeptide binding domain of interest and a corresponding peptide ligand, each in the form of a 
distinct fusion protein. Each such fusion protein thus comprises, among other component regions, 
at least one PBD or peptide ligand sequence. The two fusion protein are designed !o be capable of 
forming a complex witn each other tnrcugh binding of the tyrosme-phosphcrylated ligand oomam of 
one fusion protein to the PBD of the other fusion protein. In the presence of a kinase activity 
capable of tyrosine-phosphorytating the ligand domain, as discussed elsewhere, and in hormone- 
dependent embodiments, in the presence of the necessary hormone, the cells express the reporter 
gene unless an inhibitory substance is present which interferes with the formation or persistence 
of the two-hybrid complex required for transcription of the reporter gene. In this assay, the cells 
are cultured or maintained in a suitable culture medium to establish a base-line for expression of 
the reporter gene. The test composition is added to the culture medium and the ability of the test 
composition to inhibit expression of the reporter gene is measured. A senes of experiments with 
different concentrations of the test compositions can be conducted in parallel. If the level of 
reporter gene expression is reduced in the presence of the test composition, the test composition is 
an inhibitor. It the structure of the inhibitor so identified is not yet known, the compound may then 
be isolated from the other assay components and characterized. It may be re-evaluated, if desired, 
using engineered cells expressing analogous fusion proteins but bearing different PBDs and/or 
peptide ligand domains. This provides a means to confirm the selectivity of the interaction of the 
inhibitor with the PBP which it was identified. If desired, the binding affinity of the inhibitor tor 
the PBD or peptide ligand domain with which rt was identified may be determined, e.g. such as 
through the use of BIAcore«> technology. The inhibitor so identified may be assayed in an in vitro 
binding assay as described above and may further be evaluated tor pharmacological and/or 
toxicologic^! activity in various in vitro and/or in vivo assays, as desired. 
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A wide variety of test compositions can be assayed in accordance with the invention 
■nc.ud.ng e.g.. m.crob.al broths, cellular extracts, conditioned media from cell l.nas or from host 
cells transformed w.th genetic librar.es. collections of synthetic compounds, combinatorial 
libraries or other compounds or mixtures from synthetic programs based on convent.onal 
m«rt.rinal chem.strv approaches or structure-based drug design. 

This mv^ thus provides a means tor identifying se.ect.ve inhibitors of phosphopepbde 
. Z h " J"' * Is noted at the outset, phosphopeptide bind.ng domains are present « a wide 
ZZ2 T£X2' fncludmg proteins .nvo.ved .n intracellular signal transduction Pathway. 
Levant to a numbe important normal and disease processes. According*, inhibitors ^nt*.d 
Sis invention may be useful tor a vane* of purposes First they may be 
Linlo^l reaoents in assays as descr.bed herein tor functional classrficabon of a phosphopeptide 
^ r phoXpep,^ receptor o, a part.cufar prote.n. bandar* ■ . nj^J*^ 
protein Families or classes of such proteins may thus be defnted funct.onal*. with respect to 

b,ndin More P o^ C "nhib,.or y agents of this invention can be used to inhibit the occurrence of 
b.olooical events resutt.ng from molecular .nteractions med.ated by a phosphopeptide binding 
b d^in Th^nvention tnus provides a method and reagent s tor .nhibi ting (t °"*> °^^ - 
,nteract.on between a protem containing a phosphopeptide b.ndlng doma.n and a natural hgand 
hlreto , e Z orote.n wh.ch normal* b.nds in a cell to the SH2. PID or other phospnopept.de 
b h n"ng protetn) or a P^og-cal .etvny med.ated by such interacts. In a research context such 
SSn can be used to sLy the biological ro.e of the s.gnal.ng even, and the ce.l and molecular 
biology of the affected signal transduction pathway. 

An mhib.tor identihed by the method of this invention can be formulated mto a 
nharmaceC n ica compos-t.on contain.ng a pharmaceutical* acceptable earner and/or other 
excTent" uslg ^ventiona, mater.als and means. Such a composition can be •m.M to an 
an'maT eilner human or non-human, for prevention or treatment of a disease or condition 
e Sng from ceuoiar events .nvolv.ng a phosphopeptide,PBD-med,..ed P^™^ 
.meractfon. Adm,n IS ,ra„on of such composit.on may be by any convention I route P^enteraL 
oral inhaiat.on. and the like) us.ng apprcor.ate formulanons as are well known ,n this art. 
"h bitor of th,s .nvention can be employed ,n adm.xture w.tn n^™*™^'^ 
oharmaceuucally acceptable organic or morgan.c earner substances suitable for pa enteral 

J— Inhibits first .dentHied by these methods. ^^S^TSS 
contain.ng such inhibitors, and pharmaceutical methods invoking adm n.strat.on of such 
\ , ,« r^t.«r,. s for the treatment or prevention of diseases med.ated by the b.nd.ng 

t:zx « phosph^^ ^ « 

0i ^cZT^ and advantages of the present ,nventK>n are desenbed further ,n the following 
detailed description ot the preferred embodiments thereof. 

Brief Description of the Drawing* 

Pir^ i ri«nirts vanous assay configurations of this invention. 

FIG 2 is a tar qnTph dep.ct.ng the results of a zeta «) ITAM-tandem ZAP SH2-dependen 
two-hybrid JmaSon g ™mamma..an cells Secreted a,kal.ne phosphatase (SEAP) activUy (BU) » 
pTo^d agamst ce.ls wh.ch received the following DNAs . which cor-pon. t h« > columns he 
grapn: (1) earner DNA. (2) pMSZ (Example ID). (3) pMSZ and pMAZ22 (Example 1E). ( 
nMS I GALA DNA binding doma.n-vSrc kinase), and (5) pMS and pMAZ22. 
pMS (GAL4 DNA t »na g , ateo assa stem . The bar graph dep.cts the results 
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which received plasmid DNAs in duplicate, i.e., cols. 1 and 2 are earner DNA; cols. 3 and 4 are 
pMSZ; cots. 5 and 6 are pMSZ and pMAZ22: cols. 7 and 8 are pMerSZ (Example 1 F); and cols. 9 and 
10 are pMerSZ and pMAZ22. 

FIG. 4 is a bar graph ot a ZAP two-hybrid assay plotting SEAP activity units vs. four stable 
clones A through D. each grown with (dark bar) or without (light bar) inducer (estrogen). 

FIG. 5 is a bar grapn plotting the time course of SEAP production (RU) in Clone D with 
inducible Zeta ITAM-tandem ZAP SH2-dependent two-hybrid activity vs. time following estrogen 
addition to media from 0 minutes to 24 hours. The graph also illustrates that parent ceils 
(unengineered G5IL-2 HT1080 cells) do not produce SEAP when cultured in the presence (+E) or 
absence (-) of estrogen. 

FIG. 6 is a bar graph plotting SEAP activity (RU) vs. specific Beta ITAM-Src SH2-dependent 
two-hybrid formation in mammalian cells transfected with plasmid DNAs, which correspond to the 
columns in the graph: (1) pMSB (Example 1K); (2) pMSB and pMAS2 (Example 1N); or (3) 
pMSB and pMAZ22. 

FIG. 7 is a bar graph plotting SEAP activity (RU) vs. estrogen regulation of Beta ITAM-Src 
SH2-dependent two-hybrid formation in mammalian cells. The presence (+) or absence (-) of 
estrogen is indicated The cotumns represent cells transfected in duplicate with pMerSB 
(Example 1L) [Cols. 1 and 2] or with pMerSB and pMAS2 (Example 1N) [Cols. 3 and 4], 

FIG. 8 is a bar graph plotting SEAP activity (RU) of a stable mammalian cell line containing 
an inducible GAL4-estrogen receptor t.b.d.-VP16 transcription activator in the presence {+) or 
absence (-) of estrogen. 

FIG. 9 is a bar graph plotting SEAP activity (RU) vs. estrogen regulation of Beta ITAM-Fyn 
SH2-dependent two-hybrid formation in mammalian cells observed by transient transaction ( FIG 
9, panel A. columns 1-4 of the graph) or when using a stable cell Hne (FIG 9. panel B. column 5 
and 6). The presence {+) or absence (-) of estrogen is indicated. The columns represent cells 
transfected in duplicate with pMerSB (Example 1L) [Cols. 1 and 2) or with pMerSB and pMAF2 
(ExamDie 1P) [Cols. 3 and 4] Columns 5 and 6 represent a stable cell line (prepared as described 
in example 2) containing Doth pMerSB and pMAF2 cultured in the absence (column 5) or presence 
(column 6) of estrogen 

Detailed Description ot the Invention 

The present invention addresses current needs in the art of drug discovery by providing a 
mechanism-based, cellular two-hybrid assay tor identifying inhibitors of key molecules involved 
in certain signal transduction pathways. This assay permits the determination ot whether a test or 
unknown compound is capable of blocking a specific phosphopeptide- mediated association, and 
provides compositions and methods tor identifying and functionally characterizing such inhibitors. 
This invention permits the identification of inhibitors of the molecular association between an SH2. 
PID or other phosphopeptide binding domain of a given protein and a naturally occurring hgand 
therefor. The inhibitors so identified are themselves useful in therapeutic and pharmaceutical 
compositions and regimens. 

Among other features, this invention provides genetically engineered cells, preferably 
eucaryotic cells, useful for high throughput identification of compounds which interfere with, 
inhibit or otherwise impede interactions mediated by the binding of a tyrosme-phosphorylated 
hgand with its phosphopeptide binding protein. The cells of this invention contain and are capable of 
expressing a reporter gene and genes encoding two fusion proteins. One of the fusion proteins 
contains a phosphopeptide binding domain, the other contains a Hgand tor that PBD. The two tusion 
proteins are capable of binding to each other and when they do sc. of detectably activating the 
transcription of the reporter gene. 
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One or both ot the fusion proteins may also contam one or more optional domains or elements, 
eluding a domain capable of regutatably sequestering the fusion protein at a des.red cellular 
cation or compartment or in an inactive conformation. 

In the practice of this invention, cells containing the components mentioned above are 
jltureo under suitable conditions and in a suitable culture medium permitting cell growth and a 
Mectabie level of expression of the reporter gene. A test substance is added to the medium, the 
Mis are cultured for an appropnate incubation period, i.e. permitting an inhibitor to enter cells, 
ock two-hybnd formation or persistence and effect a detectable change in reporter g*n» 
<pression. The level of expression of the reporter gene is men measured. A decrease*, tne level of 
Session of the reporter gene indicates that the test suostance is a candidate inhibitor. 

The assay may be repeated using a control cell containing the reporter gene and its associated 
mtrol elements and further containing and being capable of expressing a gene encoding a 
en cripuon factor tus.on protein as a posit.ve control. The transcription ^ctor fusion protean 
3 ntains both a DNA b.nd.ng domain and a transcnptional activating domain permitting it to actuate 
anscr.pt.on of the reporter gene The DBD and TAD are typically the same domams which are 
lately present in the respective DNA-b.nd.ng and transcriptional activatmg fusion protects. 
eTusty described. Attentively, the control cell may express in place of the "ansenpt^n 
ctoMus^on protein, two at.emat.ve two-hybrid tus.on proteins which were d f*^ d H ba t ^ d n 0 n n m " 
merent protein binding pair man the phosphopept.de b.nding pair represented by by the pnmary 
vo hybrid assay cells as descr.bed above. The failure ot the test substance to s.gnrt.cantly after 
,e level of expression ot ihe reporter gene m such controls supports the conclusion that the tes 
ubstance is not generally inh.bit.ng the assay or inhibiting the assay read-out via an '"elevanl 
^cnanTsm Such controls help confirm the phosphopeptide binding inhibitor status of candidate 
lhibitors. 

" T'o'^cilita'tr'understand.ng ot this mvent.on, the tollow.ng definitions are prov.ded: 

A ■•oect.de ligand" as used herein oetmes a polypeptide mot.t or tragment thereot wh.ch 
onta.ns one or more tvros.ne res.oues. ano which „ capable of b.ndmg to a pnosphopept.de binding 
omain sucn as an SH2 or P.D domain when a, leas, one o. the peptide l.gand 
nosphorvlated. A pept.oe Lgand may be an oligopeptide, polypept.de or prote-n tragment or 
ort.on (The terms "polypeptide- and 'protein" are used interchangeably herein.) 

A "phospno^ptioe bmding domain" (PBD) is an am.no ac.d sequence or polypeptide reg.orv 
/o.callv presenT'n certain s.gnal.ng prote.ns. wh.ch ,s capable of selective and specific b.nd.ng to 
'S^n^tS^-pho.pnonSil.d ligand. The bindtng of a tyros.ne-phosphory.ated l.gand 
3 a PB ??s considered to mediate or direct the assocation of the proteins and the transducnon o. the 

^hUphoTeptibe b.nd,ng pair" (PBP) consists of a PBD and a peptide ^ thjjj^ 
An "SH2 domain" is an example of a PBD. An "SH2 ligand" is an example of a peptrie 
A -phospnopept.de b.nd.ng pa.r inhibitor" is a compound which can be shown to to .nh* the 
ormaL'or persisTa'nc. o. bind.ng between a PBD and a ^-^SK^TS^ 
heretor. A phosphopept.de b.nd.ng pa.r .nhibitor may function by b.nd.ng t . a PBD wrth 
-ompetitive avidity v,s-a-v.s a tyros.ne-phosphorylated pept.de ligand tor the PBD. b.nd.ng to a 
vrTstne-ohosphorylated peptide l.gand tor the PBD with competitive avidity vis-a-v.s the PBD. 
nhTnSse dTsnXng me normal 'b.ndmg between a PBD and a tyrosme-phosphory.ated pept.de 

' 98nd A "DNA B.nd.ng Domain" (DBD) .s a polypept.de or prote.n ooma.n. e.g. a bort.cn ot a 
,anscr.p«7on factor, wri.ch b.nos to a DNA seouence. typ.cal.y a transcr.pt.on regulatory DNA 
element linked to a target or reporter gene. 



6 



WO 97/31113 



PCT/US97/D2635 



A Transcription Actrvator Domain* (TAD) is a polypeptide or protein domain, e.g. a portion 
of a transcription factor, which when in proximity to transcription regulatory DNA elements of a 
target gene, activates gene transcription (and is typically linked to a nuclear localization 
sequence). 

5 A "reporter gene" is a gene which upon expression, produces a detectable gene product. 

The term "DNA element" is used to refer to transcription regulatory DNA sequence, typically 
upstream of a target or reporter gene, which binds to the DBD and permrts transcnption m the 
presence of an appropriate transcription factor or two-hybrid complex. 

A "DNA binding fusion protein" is a fusion protein composed of at least one copy of a DBD fused 
0 to at least one copy of a member of a PBP. A "Transcription activator fusion protein" is a fusion 
protein composed of at least one copy of a TAD fused to at least one copy of the other member of a 
PBP. 

1 1 . The Engineered Cells of the Invention 

5 

A The Host Ceils 

Any eukaryotic cells which can be transformed with heterologous DNA and which can be grown 
or maintained in culture may be used to prepare the novel engineered cells of this invention. 
However, mammalian cells are presently preferred. Mammalian cells include, without limitation. 

0 those of mouse, hamster, rat. rabbit, dog, cow or primate, including human, origin. They may be of 
a wide variety of tissue types, including without limitation, mast celis, fibroblasts, osteoclasts, 
osteoblasts, macroohages, neutrophils, other T cells, epithelial cells, endothelial cells, hepatic 
cells, kidney cells, or other cell types and may be primary cells or cell lines. 

Yeast cells may also be used as the host eukaryotic cells of this invention. 

5 The host cells of this invention are engineered by the introduction therein of heterologous 

DNA(s) encoding and capable of directing the expression of the fusion proteins as well as 
heterologous DNA comprising a reporter gene under the transcnption control of a DNA element 
responsive to the "two-hybrid" complex of the DNA-bmding fusion protein with the transcnption 
activator fusion protein 

) The DNA mo)ecule(s) encoding tne two fusion proteins, as well as DNA comprising the 

reporter gene and its associated DNA element, may be introduced into the host celis as separate 
DNAs or as one or more DNAs linked together. 

B. Reporter Gene Constructs 

Suitable reporter genes include any gene whose expression can be detected Many such genes 
are known to the art. including, without limitation, those encoding an enzyme which catalyzes a 
reaction resulting in a detectable color change, an enzyme that permrts continued cell growth or 
viability in the presence of an otherwise toxic agent or absence of an otherwise essential nutrient, 
a repressor or suppressor of a biological function which is activated or enhanced by the reduced 

1 expression of the "reporter gene or simpty a gene product which can be conveniently detected. 

Examples of reporter genes which may be used in the practice of this invention include, 
without limitation, genes encoding enzymes such as luaferase, beta-galactosidase. alkaline 
phosphatase and horseradish peroxidase. The proteins encoded by these genes catalyze reactions 
generating readily detectable products. Alternatively, the reporter gene may encode a readily 
detectable cell surface marker sucn as Thyt or a secreted protein such as hGH for which there is a 
convenient, and preferably commercially available, assay. One well known reporter gene, which is 
exemplified herein, is the gene encoding secreted alkaline phosphatase (SEAP). [Berger, J et al 
Gene. 66:1-10 (1988)] 
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The production of SEAP may be monitored from the culture media without the need to fyse the 
ells. This permits convenient detection, using a conventional colonmethc or fluorescent assay, at 
lifferent time points during the assays of the invention. 

The reporter gene is linked to a DNA element recognized by the DBD and permitting gene 
expression in response to two hybrid formation (i.e., complexation of the first and second fusion 
trotems). The DNA element may comorise a synthetic promoter sequence consisting of multiple 
)BD binding sites upstream of a TATA box element. The number of such binding sites can be 
idiusted to optimize the overall levei of reporter gene expression induced by the fusion P™ 1 *™ oi 
his invention Examples of such binding sites indude GAL 4 binding sites (see e.g. Acheson. "DNA 
umor Viruses- Cold Spring Harbor, NY. (Tooze, J. ed.. 1980) pp 151-160: Peden et al, Science. 
>09:1392-1396' (1980); Liu & Green. Cell. 61:1217-1224 (1990); and Ptashne & Gann, 
mature. 346:329-331 (1990)]. 

The DNA element of the reporter gene, when introduced into cells, wrll preferably not be 
occupied in the absence of exogenously added DBD of the second fusion protein The extent of 
occupancy of the DNA element may be determined by measuring the level of reporter gene 
ranscnption in the absence and presence of the the fusion proteins, the kinase domain or the 
ranscnption factor fusion protein. 

By way of illustration, one particular embodiment of the invention uses a reporter gene 
instruct containing DNA encoding SEAP linked to a DNA element containing iterated GAL4 binding 
•rtes Additional guidance in the use of suitable promoters for this reporter gene may be obtained 
rom" Fields et al, US Patent No. 5.283.173 (1 Feb 1994) and Vasavada et al. Proc. Natl. Acad. Sc... 
JSA, 8B:10686-10690 (1991). 

C Fusion Protetn Constructs 

The two fusion proteins of this invention typically contain either (a) one or more DNA 
Dindmg domains fDBDs", which may be the same or different), or (b) one or more transcnption 
activation domains (TADs", which may be the same or different), in addition to one or more 
DhosDhocept.oe b.ndma domains or peptide ligands therefor. For instance, in one emoodrment. the 
>irst fusion protein has an SH2 domain or a tandem SH2 region (two SH2 domains) and a 
inscription activation domain while the second fusion protein has a DNA binding domain and an 
SH2 ligand domain, capable after tyrosine phosphorylation, of binding with the SH2 domam(s). 

1st fusion protem: SH2 domain-TAD 

2d fusion protein: DBD-SH2 ligand 

In preferred embodiments, the second fusion protein contains a hormone ligand binding domain and 
a protein tyrosine kinase domain Unking the DBD and SH2 ligand domain, as discussed in further 

detail below. . 

The DBD and the TAD of the two fusion proteins typically originate from a transcnption 
factor The DBDs and TADs may be derived from the same or different transcnption factors and may 
be optimized by genetic engineering. DBDs and TADs are often denved from transcnption factors 
having separable DNA-binding and transcription activation domans. 

i. Transcription Factors 

A large number of transcnotion factors are known which require two subunits for activity, 
that is a single transcnot.on factor can be drvided into two separate functional domains (e.g. a TAD 
and a DBD). While each domain is inactive (transcriptionally) by itself, the paired components, 
when in close proximity, comprise an active transcription factor. 
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Transcription factors which can be used rn this invention include yeast GAL4, which can be 
divided into two domains [Ma and Ptashne. Cell. 48: 847-853 (1987) and US Patent No 
5,283.173]. One may use, for instance, a fusion of GAL4(1 -147)-SNF1 and 
SNF4-GAL4(768-88t), where the SNF1 and GAL 4 may be replaced by the subject binding 
proteins as binding domains. Combinations of GAL 4 and VP 16 or HNF-1 and VP 16 can be employed. 
Each of these proteins will have a linked partner, pari ot an interacting pair. The interaction 
(binding partner recognition) will bring the TAD (VP16) to DNA (GAL4 or HNF-1) and 
transcnption will be induced. Other transcnption factors that may be used for the method of this 
invention include, but are not restricted to. the members of the Jun, Fos, and ATF/CREB families. 
Octl. Sp1. HNF-3, the steroid receptor superfamiiy, and the liKe. Other TADs and DBDs which may 
be used in the practice of this invention are well known in the art. 

The transcnption factor can be endogenous or exogenous with respect to the desired host cells. 
If the transcription factors are exogenous, but functional within the host and can cooperate with the 
endogenous RNA polymerase (rather than reoutrtng an exogenous RNA polymerase, for which a gene 
could be introduced), then an exogenous promoter element functional with the DBDs or TADs of the 
transcription factor can be provided with another construct for regulating transcription of the 
reporter gene. Thus, the initiation ot transcription can be restricted to the gene(s) associated with 
the exogenous promoter region, i.e., the reporter gene(s) 

ii. DNA Binding Domains 

DNA-brnding domains tor use in this invention may be selected from a wide variety of DNA- 
binding domains known in the art. Examples include DNA-binding domains of the homeodomain 
class, the zinc-finger class, and the paired-box class, tor which numerous examples are known in 
the literature. In cases where detailed information on the molecular contacts of the protein with 
DNA is available, the DNA recognition specificity of the protern may be engineered by amino acid 
substitutions. 

A preferred feature tor DNA binding domains is the lack of interactions with other cellular 
proteins. Alternatively, the nature of such interactions should be known wrth precision, sc that 
these interactions can be abrogated by suitable ammo acid substitutions within the DNA-binaing 
domain. In addition, the DNA binding domain will preferably be unique for the celt type utilized. The 
DNA binding element, when introduced into cells, will preferably not be occupied (at least not 
unduly) in the absence ot exogenousry added DNA binding domain. The level o1 background occupancy 
may be determined through appropriate controls as discussed previously 

Cell-type specific factors (not found in the cells used for the in vivo assay), DNA binding 
domains with altered binding specificity and procaryotic DNA binding domains are well suited for 
the methods described. One suitable DNA-binding domain is oerrved from the Phoxl protein 
(Grueneberg, D. A., Natesan, S., Alexandre, C. and Gilman, M. Z. (1992). Human and Drosophiia 
homeodomain protein the enhance the DNA-binding activity of serum response factor. Science 257, 
1089-1095; Genbank accession number: M95929). This protein is a member of the homeodomain 
class. A 69 amino-acid domain derived from Phoxl is sufficient to bind DNA. High-aftinity DNA 
recognition sequences for the protein have also been identified, as have amino acid substitutions 
that change its DNA recognition specificity and that affect its ability to interact with certain 
endogenous proteins in human cells. (Grueneberg et al) 

A second suitable DNA-binding domain is derived from the SRE-ZBP protein (Attar. R. M. and 
Gilman, M. Z. (1992). Expression cloning of a novel zinc-finger protein that binds to tne c-fos 
serum response element. Mol.Cell. Biol. 12. 2432-2*43; Genbank accession number: M88579). 
SRE-ZBP is a member of the C2H2 class ot zinc-finger proteins. It has seven tandem zinc fingers. 
Any one of these zinc-finger domains or any combination of two or more domains can be used to 
generate a DNA-binding domain wrth novel recognition specificity. Furthermore, because the 
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general structure and mode of DNA recognrtion is known tor proteins of this class. DNA recognition 
can Be directly modified if necessary. 

A third suitable DNA-binding domain is the GAL4 binding domain referred to previously. 

A fourth suitable type of DNA-binding domain is a composite DNA binding domain as described 
,n published international Patent Application No. WO 96/20951 and exemplified by ZFHD1. See 
also Pomeranz et at. 1995. Science 267:93-96. 

I i i . Transcription Activation Domains 

Preferred transcription activation domains are compact in size, potent in activity, and 
nontoxic to cells Additional features Become relevant for particular applications. For example, tor 
constructs to be used in one particuiar type of cell or tissue, activator domains with especially high 
activity in the host cell are used. In applications that involve the stable integration of the target 
aene into host cell DNA. activator domains that resist chromatin suppression are used. 

Numerous activation domams are known in the art. Two illustrative activation domains wh.ch 
mav be used tnt he practice of this nvention are the VP16 activation domain and the NF-kB P«5 
Sa^on domain See e.g. Inlernat.ona. Paten, Application Nos. WO 96/20951 and WO 96/41865 

iv. Phosphopeptide Binding Pairs 

Each of the fusion proteins contains either at least one TAD or at least one DBD linked to at 
least one member of a PBP. i.e. to at least one pnophopeptide binding domain or peptide ligand 
theretor One exemplary PBP is an SH2 domam and a peptide ligand therefor (e.g.. a ZAP SH2 
domain and an •mmunorecep.or act.vation motif such as the TCP zeta ITAM) As, noted prevo ^ 
the ligand domain will contarn one or more Tyr residues, which must be phospnorylated to bind to 
the SH2 domain By way of further example, the peptide ligand domain may be an IgE ITAM. tne 
polyoma middle T phosphorylation sequence or a synthetic consensus tyrosine phosphorylation 

binding site. „ . . „ 

In some embodiments, inclusion of protein domains that are not generally regarded as 
.nfluencmg erther allosler.cally or ctherw.se. the behavior ol the SH2 or PID or other such 
domain may be mcluded to stabil.ze the folding, enhance expression or provide a means ot 
.dentrlying or man.pulat.ng the protein (e.g. fusion to glutattvone-S-transterase or other domams 
that aid in identification/purification, epitope tags, etc) 

As illustrated below, the fusion proteins may contain a plurality of PBDs or liganos 
therefore. For example, one embodrment ot the present invention uses a fusion protein contam.ng a 
tandem SH2 domain from ZAP-70 (containing two SH2 domains). 

Peptide hgand(s) members of the PBP for use in the luston proteins are defined above, i-or 
example, where one member of a binding pair member is an SH2 domain, the other member, us 
the peptide ligand. may be a naturally occurnng ITAM contam.ng at least one Tyr ""^B"" 
in their nonphosphoryiated state will not bind the PBDs. These l.gands must be phosphorylated by 
endogenous or (introduced) hetlerologous protein tyrosine kinase activity within the cell. 

A large number of SH2 domains. PIDs. and other phosphopeptide bindug domains, as wen as 
..gands therefor, are known in the ah and may be adapted tor use in this .nvention The 
additional background .nformat.on and guidance may be of interest and use to the practitioner. 

Identification of SH2 or SH2-like Domains 

The tern 'SH2-like oomain or a subdomain thereof refers to a seguence wh.ch is 
substantially homologous to a Src homology region 2 (SH2 region), or a subdomam ol Ian SHwj 
preferably a conserved region of an SH region. The Src homology reg,on is a ™ c ««"*;* J™"'" * 
llOO amino acids which was or.g.nal.y identified in the viral Fps and viral Src ^lasmic 
tyrosine kinases by vtrtue of its effects on Doth catalytic act.v.ty and substrate phosphorylation (T 
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Pawson, Oncogene 3. 491 (1988) and I. Sadowski et al.. Mol. Cell. Bio!. 6. 4396 (1986)). SH2 
domains have been found in a vanety of eukaryotic proteins, some ol which function in 
intracellular signal transduction. Many are known inthe art. Examples (including counterparts 
from various species) of SH2 domajn-contarning proteins include (1) members of the src-famrty 
orotein tyrosine kinases (Src. Lyn. Fyn, Lck. Hck. Fgr. Yes), (2) She (3) Tsk, (4) Btk, (5) VAV. 
(6) Grb2, (7) Crk. and (8) signal transducer and transcription (STAT) proteins. In addition, a 
number of proteins, such as ZAP-70. p85 phosphatidylinositol 3' kinase (PI3K), Syk, GTPase 
Activating Protein (GAP), and Phospholipase C gamma, have two SH2 domains. SH2 domain- 
containrng proteins have been identified in human, rodent, sheep, bovine, C. elegans. Drosophila. 
Xenopus. flatworm. freshwater sponge, and hydra. 

One way to identity new SH2 or SH2-like domains from unknown DNA. RNA or protein 
sequence is by using one of many available computer alignment programs. One example is ptscan, 
which can be run via the World Wide Web (WWW) site at 

http://ulrec3.unil.ch/sotrware/profilescan.html. To use the program, a protein sequence is tested 
against a "profile" describing the SH2 domain motif. According to the program information, the 
particular strength of profiles is that they can be used to describe very divergent protein motifs 
These profiles are normally derived from multiple alignments of the initial sequence set. In 
addition to the sequences themselves, a profile identifies which types of residues are allowed at 
what position within the domain, which amino acids are conserved, which ones are not. which 
positions or regions can allow insertions, and which regions may be dispensable. Additional 
information on Ptscan and PROSfTE can be obtained at the web page 

http://ulrec3.unil.ch/index.html operated by the Bioinformatics Group at the ISREC (Swiss 
Institute for Experimental Cancer Research). 

As an example we analyzed the peptide sequence of human Src with the pfscan program. The 
results are shown below. The program clearty identified the SH2 domain of Src as encompassing the 
region from amino acids 150-247 of the Src peptide sequence. In addition, the SH3 and kinase 
domains were identified by pfscan. 

NScore raw from-to Profile I Description 

26.9695 1792 pos. 150-247 PS50001 ] SH2 Src horology 2 (SH2) cicrraxn 

20.2947 1182 pos. 63-144 PS50002 j SH3 Src hcmology 3 {SH3 ) acuain 

43.4246 2912 pos. 269-522 PS50011 | PRai^N_KINASIL.DCM Proceun kinase 

The NScore of a match is the negative decadic logarithm of the expected number of matches of the 
given quality (or better) in a random database of the given size. For NScores <<t this converges to 
the probability of finding the match in the database. Since the number of expected maiches depends 
on the size of the oatabase. the decadic logarithm of the database size must be subtracted before the 
calculation: 

-log(NExp) = NScore - log (DBsize) 

where (NExp=Expected number of chance matches) and (DBstze«size of the database in 
characters). 

The following table gives somes examples on how to convert the NScores into probabilities for 
the SwissProt database and the nonredundant (nr) protein database. The calculation is based on a 
database size of 

16.531,385 residues for SwissProt (log=7.27) 
58.154,119 residues tor the nr database (log=7.76) 



1 1 



WO 97/31113 



PCT/US97/02635 



NScore 



Expected chance imtches in: 
SwlssProt nanrecLindant 



7.0 
7.5 
8.0 
8.5 
9.0 
9.5 
10. 0 
10.5 



1.8 5.8 
0.58 1.82 
0.18 0.58 



0.058 0.182 

0.018 0.058 

0.006 0.0182 

0.0018 0.0058 

0.0006 0.0018 



...and so on... 

The segment of a test sequence contains an SH2 domain with an SH2 profile NScore value > 7.5. 
preferably > 8, more preferably > 9, more preferably > 10. 

As a second example, the N-terminal 160 amino acid sequence from human ZAP-70 was 
applied to ptscan. The result indicated an SH2 domain bounded by amino acids 10-102. 

NScore raw f rem- to Profile | Description 

16.44C2 1082 pos . 10-102 PS50001 | SH2 Src homology 2 (SH2) domain 

The minimal segment encoding an SH2 or SH2-like domain as determined by sequence 
alignment may not be enough to function or function optimally in the assay. Additional sequences 
either N-terminal and/or C-termmal of this minimal segment may be necessary or desirable tor 
optimizing the binding ot the encoded protein to a peptide ligand within the context of the invention. 
These additional sequences may be aerived from the natural protein sequence or may be derived 
from other proteins or even non-natural sequences that are added to the expression vector. 
However, the sequence required can be readily determined biochemically, in an in vitro binding 
assay, or genetically, using the production of SEAP as an indicator of SH2 domam-pnosphopeptide 
interaction. 

SH2 aomains can be identified using other computer alignment programs, such as MegAlign 
within the DNAstar computer package (Madison, Wl). To do this, one or more known SH2 domains 
and a test sequence are aligned by the ciustal method. A sequence having £ 25%, in some cases 30 - 
50 %, in other cases > 50%. amino acids identical to a known SH2 domain is identified as an SH2 
homology domain. The positions of identical amino acids between the test sequence and different 
known SH2 domains can vary, except for one position. All SH2 domains identified to date have a 
conserved argmtne residue approximately 25-40 residues from the start of the SH2 homology 
domain. In human src this arginine is found within the sequence FLVRES. where abbreviations tor 
the ammo acid residues are: F. Phe; L. Leu; V. Vat; R. Arg; E. Glu; S, Ser. 

Another way to identity SH2 or SH2-like domains ts by running a query in the federated 
nucleotide or protein databases tor the SH2 domain feature, in the SWISS-PROT database, this is 
listed under the FT or feature" heading. SWISS-PROT database can be accessed over the WWW at 
EBI http://www.ebi.ac.uk. For example, in the file listed tor human Src (P12931). the region 
containing the SH2 domain is shown to be 150-247. 
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SWISS -PPOT: P12931 
ID SRC_HIM&N 



STANDARD; 



TO; 



535 AA. 





AC 


P12931; 










EF 


MEM; 190090; 






5 


ER 


F8DSTTZ; 


PS00107 ; 


PROIEZN^KINASELAIP . 




ER 


FPDSTTE; 


PS00109 ; 


PROIEEN_KII^SELTyR . 




ER 


FRQSi'IK- 


PS50001; 


SH2 . 








FROSTTE; 


PS50002; 


SH3 . 








PROSITE; 


PS500U; 


FTOIEEN_JQNASE_DCM . 


n 


np 


PROCCM {Dcrnaxn structure / 


List of seq. sharing at least 1 dcmain] 




ER 


SWISS -2DPAGE ; GET KEEICM CM 2D PAGE. 




KW 


TRANSFERASE; TYROSINE- PROTEIN KINASE; PRCTO-aSDOCSNE ; FHDSPHOFCfLATLGtt 




KW 


AT?- BINDING; KYFJ^STYLATTCN ; 


SH3 DCMA3N; SH2 DCMMN. 




FT 


3HT _>ET 


0 


0 


BY SIMILARITY. 


5 


FT 


LIPID 




1 


MYRISME (BY SIMILARITY) . 




FT 




63 


144 


SH3 . 




FT 




150 


247 


SC. 




FT 


IXMAIN 


269 


522 


PRuii^JNj kh^se. 




FT 


NP_BIND 


275 


283 


ATP (BY SIMILARITY) . 


0 


FT 


3INDIN3 


297 


297 


ATP (BY SIMILARITY) . 




FT 


ACT__SITE 


388 


388 


BY SIMTLARITY. 




FT 


MXLRES 


419 


419 


PHOSPHORYLATION (AUTO-) {BY SIMILARITY) . 




FT 


MXLRES 


529 


529 


MDSPHDKYIATICM (BY SIMILARITY) . 



5 Yet another way to identity SH2 or SH2-like domains may be accomplished by screening a 

cDNA expression library with a phosphorylated peptide ligand tor a known SH2 domain to isolate 
cDNAs tor SH2 proteins. One could use PCR or low stringency screening with an SH2-specific 
Drobe. The SH2 domain or protein containing the SH2 domain may be isolated trom naturally 
cccurrng scurces (e g cells, tissues, organs, etc); produced recombmantly in bacteria, yeast or 

0 eukaryotic cells; proauced in vitro using cell tree translation systems; or producea synthetically 
(e.g. peptide synthesis). 

Certain SH2 or SH2-like domains may not be identified via the pfscan program nor exhibrt 
significant homology with known SH2 domain sequences to be detected by computer alignment 
programs. These seauences may, nevertheless, exhibit the same or simitar three-dimensional 

5 structure as known SH2 domains ana function as an SH2-like domain and function to bind 

phosphotyrosine-containing peptides or proteins. The three-dimensional structure of several 
known SH2 domains have been determined. SH2 domains are characienzed as two anti-parallel beta 
sheets composed of 5 or 6 beta strands. Regions forming an alpha hehx may or may not be present 
within the domain. SH2 or SH2-Uke domains may be recognized as having an SH2-like domain 

0 structure when solved by x-ray crystallography or NMR spectroscopy. Alternatively, a predicted 
structure by homology modeling may be used to identity a particular protein sequence as an SH2- 
like domain. 

The alignment of SH2 domains used to generate the SH2 profile for pfscan. as taken trom 
http://ulrec3.unil.ch/prf_aetails/alignments/SH2.msf (profile matrix can be obtained from 
5 http://ulrec3.unil.ch/cgi-bin/get_pstpr17SH2) is based on alignment of approximately 390 SH2 
domains from proteins of various species. The list of proteins containing SH2 domains used in the 
alignments in the Swiss-Prot Database includes the following (P##### is the Swiss-Prot 
Database Accession number): 
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PO0519, 

P00522, 

P00525, 

P00528, 

P00542 . 

PC 3 949, 

P06239, 

PC7332, 

P08103, 

P08631, 

P09851, 

PI 093 6 , 

P13116, 

P14085. 

P15054. 

P16333. 

P16885. 

P19174 , 

P23726, 

P24604, 

P26450, 

P27870, 

P29350. 

P29354, 

P32577, 

P35991, 

P41239. 

P41242 , 

P42224, 

F42227 , 

F42230. 

P42679, 

P42682, 

P42685, 

P42688, 

P43403, 

P46108, 

002977, 

Q04736, 

006124, 

007883, 



ASLl.HLTON 

ASL_EFCME 

SRC_AVISR 

SRC1_ERCME 

FES_FS\K3A 

FES_HLMftN 

GTPA_BC7/IN 

YES_XENLA 

SRC2_XENLA 

£RC_AVIST 

SRC.AVIS2 

PIP5_HLMW 

PIP4_HLMAN 

P85B_BaiN 

TrT_MXJSE 

PB5A_MXJSE 

VAV_MJUSE 

FIN6_MMAN 

GKB2_Hl*PtfJ 

CSK^CHICK 

CTK_MXISE 

STA1_HUMAN 

STA3_M3JSE 

STA5_>CUSE 

CTK_JUMftN 

TXK^MXISE 

SRK2_SP0LA 
2A70 JWftN 

YRK^CHICK 
YES_MDUtSE 
FIMB_HLMkN 
GRB2_CHICK 



P00520, 
PO0523 . 
PO0526, 
PO0530. 
P00543, 
P05433 . 
P06240, 
P07947, 
P08487. 
PC9324. 
P10447, 
P12931, 
P13406, 
P14234, 
P15498, 
P16S91. 
P17713 , 
P20936. 
P23727, 
P25020, 
P27446 , 
P27986, 
P29351. 
P29355, 
P34265, 
P3968E, 
P4124C, 
P41243. 

P42228, 
F4223I. 
P42680. 
P42683, 
P42686, 
P42689. 
P43404, 
P46109, 
003526, 
004929, 
006167 
OO8012 



SRC_CHICK 
SRC_RSVP 
FPSJFUJSV 
FES_FSVST 
GAGC _JKV1SC 

PIP4_BCJ/IN 

YES_CHICK 

ABL_FSVHY 

FVN^CENLA 

F3\_>OJS£ 

VAV_J*JMAN 

FER_HLMftN 

STf^HYDAT 

GTPA^HLMW 

P85^K*/IN 

SRC _^SVH1 

™ _XHHE 

PBSA^HLMftN 

FTN6_KXXSE 

YKF1_CAEEL 
FYN JCUSE 

CTK_FAT 

STA4_MXJSE 
STA5_SiEEP 
TEC „HIMAN 

SRK1_SF0LA 
SKK3_SPQLA 
ZA70_MDUSE 
CKKLu.HUMAN 

mocusE 
crkjchick 

3TK_HLMAN 



P00521, 

PO0524, 

P00527, 

P00541. 

P00544. 

P05480, 

P06241. 

P07948, 

P08630, 

P09769, 

P106B6, 

P13115, 

P14084 , 

P14238. 

P16277 ( 

P16879 , 

P1B106. 

P23615, 

P24135. 

F25911, 

P27447, 

P29349, 

F29353, 

P31693. 

P35235, 

P40763 , 

P41241. 

P41499, 

P42226, 

P42229, 

P42232 , 

P42681. 

P42684, 

P426B7, 

P42690. 

P43405. 

000655, 

004205. 

005876, 

007014, 

008881, 



AHL_>!LVAB 
SRCJLSVSR 

FPS_AVTSP 
FT2^FS\*2* 
SRCNJCUSE 
FY*U*M*N 

SRC2_ERCME 
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A general method to identity an SH2 domain within a test peptide or nucleotide sequence follows: 

1 Translate the cDNA or RNA into single letter code protetn sequence. This could be accomplished 
using a computer program such as DNA stnder or EditSeq in the DNAstar package 



2. Go to the WWW site at http://ulrec3.unit.ch/sottware/protilescan.html 
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3. Copy the test sequence into the approphate box in the pfscan form 

4. Submit the form to the ptscan server 

5 5 The results are sent back through the web browser or via e-mail. 

The subject invention is relevant to SH2 and SH2-like domains as described in the foregoing 
paragraphs. Using information provided herein, and by analogy to the examples provided betow, one 
may carry out this invention with any SH2 domain, SH2-like domain, PtD or PID-iike domain and 
0 a peptide ligand therefor, e.g. in place of ZAP, Syk, Src or Fyn SH2 domains. 

Identification of PID or PID-IIke Domains 

An alternative phosphotyrosine binding domain to SH2 domains is the so-called 
phosphotyrosine interaction domain (PID). This domain, containing on average about 160 amino 

5 acid residues, was originally identified in the She protein. In contrast to SH2 domains, which 
recognize sequences having a consensus pTyr-Xaa-Xaa-Xaa-Xaa (a phosphotyrosine followed by 
three or more amino acids). PID domains recognize sequences with the consensus Asn-Xaa-Pro- 
pTyr (also called NPXY in single letter code). The invention described in this application is also 
relevant to PID and PID-like domains. In this case, the coding sequence for a PID domain is 

0 substituted in the approphate vector tor the SH2 domain coding sequence and a ligand that 

recognizes the PID domain replaces the SH2 domain ligand. Phosphorylation of the PID ligand could 
be accomplished using v-Src, as described herein. Alternative protein kinases could be used to 
phosphorylate the PID ligand. In addition, a protein kinase endogenous within the cell could catalyze 
phosphorylation of the PID ligand. 

5 Significant information concerning these domains is known in the art. A detailed description 

of the PID domains can also be found on the WWW at the site http://www.bork.embf- 
heidelberg.de/Modules/pid-gif.html The following information is taken from that site: 

Documentation - PROSITE description 

r\ 

Beside SH2.the phosphotyrosine interaction domain (PI domain or PID)[3] is the second 
phosphotyrosine-bindmg domain found in the transforming protein She [1,2]. She couples 
activated growth factor receptors to a signalling pathway that regulates the proliferation of 
mammalian cells and it might participate in the transforming activity of oncogenic tyrosine 
5 kinases. The PI domain specifically binds to the Asn-Pro-Xaa-Tyr(P) motif found in many 

tyrosine-phosphorytated proteins including growth factor receptors. PID has also been found in 
the She related protein Sck [1] and several otherwise unrelated regulatory proteins (3] which 
are listed below. 

0 • Mammalian She (46 kD and 52 kD isoforms) contains one N-termmal PID. a collagen-like 
domain and a C-terminal SH2 domain. 

• Human She related protein Sck contains one PI domain and a SH2 domain. 

• Mammalian XII is expressed prominently in the nervous system. It contains 2 disc 
homologous regions (DHR) of aoout 100 A A downstream of the PID. 

5 • Drosophiia nuclear Numb protein is required in determination of cell fate during sensory 
organ formation in drosophiia emoryos. It has one PID. 

Caenorhabditis hypothetical protein F56D2.1 contains an N-terminal metalloproteinase 
domain followed by one PID. 
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Rat FE65. The WW domain as wall as the 2 PIDs found m the sequence of FE65 indicate that 
this protern is probably involved in signal transduction. 

Drosophila protein disabled is a cytoplasmic, tyrosine phosphorytated protein found in 
CNS axons and body wall muscles. It is involved in embryonrc neural development. It 
contains one N-terminal PI domain! 

Mouse mitogen responsive phosphoprotein isoforms P96. P93 and P67 which are produced 
by alternative splicing, contain one N-terminal P!D. This is also true for the differentially 
expressed human ortholog Doc-2. 
Human EST05O45 protein fragment has one PID. 

References: 

1] Kavanaugh W.M., Williams L.T. Science 266:1862-1865(1994) 
2) Blaiki, P. et al.. J.Biol.Chem. 269, 32031-32034 (1994) 
3] Bork P., Margolis B. Cell 80, 693 (1995) 

^ PI domain alignment based on approximately 40 PI domains from various species is illustrated in 
he WWW site at http://ulrec3.unil.ch/prf_details/alignments/PID.msf. An alignment based on 
approximately 50 PID sequences is shown at the web site at http://www.bork.embl- 
Teidelberg.de/Modules/pi-ali.html. 

The minimal segment encoding a PID or PID-like domain as determined by sequence alignment 
-nay not be enough to function or function optimally in the assay. Additional sequences either N- 
erminal and/or C-terminal of this minimal segment may be necessary or desirable for optimizing 
he binding of the encoded protein to a peptide ligand within the context of the invention. These 
additional sequences may be denved from the natural protein sequence or may be derived from 
Dther proteins or even non-natural sequences that are added to the expression vector. However, the 
sequence required can be determined biochemically, in an in vitro binding assay, or genetically, 
jsing the production of SEAP as an indicator of PID domarn-phosphopeptide interaction. 

v . Optional Domains 

One or Doth of the tusion proteins may also contain one or more optional domains, including a 
protein kinase domain, as mentioned above, or a domain capaole of regutatabry renoenng the fusion 
protein unavailable for 2-hybnd formation (e.g. by sequestering the tusion protein at a desired 
:ellular location or compartment or by maintaining the fusion protein in an inactive 
conformation). 

Kinase Domains. As one example, if the cell does not have an endogenous protein kinase 
capable of phosphorylatmg the tyrosine residues of the peptide ligand for the SH2. PID or other 
ohosphopeptide binding domain, a heterologous kinase activity is introduced into the cell A DNA 
encoding such a heterologous kinase may be introduced into the cell as an independent transcription 
unit (i.e.. with its own transcription regulatory elements, such as a promoter and/or enhancer 
sequence). Alternatively, a kinase activity may be introduced as a functional kinase domain within 
one of the two fusion proteins ot this invention. In one embodiment the fusion proteins comprise a 
lirst chimera containing one or more SH2 domains itnked to one or more transcnption activation 
domains and a second chimera containing a DBD-*unase domain-pep de hgand domain fusion. 

in one embodiment, the protein kinase domain from the v-Src rotein was used to 
phosphorylate the peptide ligand linked to the DNA binding domain. Alternative protein kinases may 
be used to conduct this phosphorylation. The recognition of protein kinase domains can be conducted 
by the same approaches oescribed tor SH2 or PID domains using plscan, ClustalV analysis, 
hybridization, or functional biochemical analysis. A segment defining the protein kinase homology 
region is approximately 280 amino acids. A oetailed alignment ot protein kinase domains can be 
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obtained at the ptscan web site listed above or at the protein kinase web site at University of 
California at San Diego (http://www. sdsc.edu/kinases/). The mmimal segment encoding a protein 
kinase domain as determined by sequence alignment Is bounded by a conserved glycine in the N- 
terminal portion and a conserved arginine at the C-terminal portion. However, additional sequences 

5 either N-terminal and/or C-terminal of this minimal seqment may be necessary or desirable tor 
the encoded protein to functionally phosphorylate the peptide hgand within the context of the 
invention. These additional sequences may be derived from the natural protein kinase sequence or 
may be derived from other proteins or even non-natural sequences that are added to the expression 
vector. The sequence required can be readily determined biochemically, in an in vitro 

3 phosphorylation reaction, or genetically, using the production of SEAP as an indicator of SH2 (or 
PID) domain-phosphopeptide interaction. For domains that recognize phosphothreonine or 
phosphoserme-modified peptides, an appropnate protein kinase would be substituted for the v-Src 
kinase domain. 

Localization domains. As another example, fusion proteins of this invention may also 

5 contain a targeting sequence providing for translocation of the protein to the nucleus. Such a 

targeting seauence may have a plurality of basic amino acids, referred to as a bipartite basic repeat 
[see, e.g., Garcia-Bustos et al, Biochem. Biophvs. Acta . 1071 :83-101 (1991)]. This sequence can 
appear in any portion of the molecule internal or proximal to the N- or C-terminus and results in 
localization of the fusion protein within the nucleus. 

) Nuclear Hormone Binding Domains. Compounds introduced into cells may be labile and 

therefore not present tor long periods of time once introduced into the cell. In such cases, rt would 
be advantageous to have all components present in the cell at the time the cells are exposed to test 
compositions. We therefore have made constructs encoding fusion proteins containing a nuclear 
hormone ligand binding domain (HBD). Numerous such hormone receptors and hgands are known 

j which may be used in the practice of this invention, including estrogen, glucocorticoid, retinoic 
acid, aldosterone and vitamin D receptors. The regulatory properties of a nuclear hormone ligand 
binding domain (Picard et al. Cell (1988) 54. 1073-1080; Eiiers et al. Nature (1989) 340. 
65-68) aliow the post-translational regulation of interaction of the fusion proteins. By way of 
illustration, an estrogen receptor domain serves as an autonomous regulatory domain upon fusion 

) to a heterologous protein, tne steroid receptor domain subjects the chimeric protein to hormonal 
control. This regulatory property is due to the hormone-reversable interactions of the HBD with 
heat shock proteins For example, a PBD-TAD fusion protein and a DBD-HBD-kinase-PBD hgano 
fusion protein can be expressed in cells containing the reporter gene construct. Only with the 
addition of steroid hormone will the fusion proteins be able to interact. In the absence of hormone, 
the HBD-containing fusion protein will not be competent to interact with its complementary fusion 
protein to form the two-hybrid complex. 

Other domains that may optionally be included withm the fusion proteins include an antibody 
recognition sequence (epitope tag) permitting detection using antibodies. Such monitoring can be 
useful to confirm that the fusion protein is made and that the level of expression is not affected by 

i compounds tested lor inhibition of phosphopeptide domain binding. 

III. Assembly of Fusion Protein Constructs, Reporter Gene Plasmids and 
Genetically Engineered Cells of the Invention 

Constructs encoding the fusion proteins and containing the reporter genes of this invention 
can be introduced into the cells as one or more DNA molecules or constructs, in many cases in 
association with one or more markers to allow for selection of host cells which contain the 
construct(s). The constructs can be prepared in conventional ways, where the coding sequences and 
regulatory regions may be isolated, as appropnate, ligated. cloned in an appropnate cloning host, 
analyzed by restriction or sequencing, or other convenient means. Particularty. using PCR. 
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individual fragments including all or portions of a functional unit may be isolated, where one or 
more mutations may oe introduced using 'primer repair*, ligation, in vitro mutagenesis, etc. as 
appropriate. For example, a DNA sequence encodmg the DNA-binding domain or TAD(s) is pined to 
DNA encodmg the appropnate member of the PBP. These sequences are joined such that they 
constitute a single open reading frame that can be translated tn cells into a single polypeptide 
harboring all intended domains. The order and arrangement of the domains within the polypeptide 
can vary. The construct s) once completed and demonstrated to have the appropnate sequences may 
then be introduced into a host cell by any convenient or desired means. The constructs may be 
incorporated into vectors capable of episomal replication <e.g. BPV or EBV vectors) or into vectors 
designed tor integration into the host cells' chromosomes. The constructs may be integrated and 
packaged into non-replicating, defective viral genomes like Adenovirus, Adeno-associated virus 
(AAV), or Herpes simplex virus (HSV) or others, including retroviral vectors, for infection or 
transduction into cells. Alternatively, the construct may be introduced by protoplast fusion, 
electroporation. biolistics, calcium phosphate trans! action, lipofection, microinjection of DNA or 
the like. The host cells will in some cases be grown and expanded in culture before introduction of 
the construct(s). followed by the appropriate treatment for introduction of the construct(s) and 
integration of the construct s). It may be most convenient to use simple transfection procedures. 
The cells will then be expanded and screened by virtue of a marker present in the constructs. 
Various markers which may be used successfully include hprt, neomycin resistance, thymidine 
kinase, hygromycin resistance, etc., and various cell-surface markers such as Tac. CD8, CD3, 
Thyl and the NGF receptor. 

In some instances, one may have a target site for homologous recombination, where it is 
desired that a construct be integrated at a particular locus. For example, one can delete and/or 
replace an endogenous gene (at the same locus or elsewhere) with a recombinant target construct of 
this invention. For homologous recombination, one may generally use either ft or O-vectors. See, 
tor example. Thomas and Capecchi. Cell (19B7) 51, 503-512; Mansour, et al., Nature (1988) 
336, 348-352; and Joyner, et al.. Nature (1989) 338, 153-156. 

The constructs may be introduced as a single DNA molecule containing alt of the genes, or 
different DNA molecules having one or more genes. The constructs may be introduced 
simultaneously or consecutively, each with the same or different markers. 

Vectors containing useful elements such as bacterial or yeast origins of replication, 
selectable and/or amplitiable markers, promoter/enhancer elements for expression in 
procaryotes or eucaryotes. and mammalian expression control elements, etc. which may be used to 
prepare stocks of construct DNAs and tor carrying out transtections are well known in the art. and 
many are commercially available. 

Additional background information and general guidance to the practitioner with respect to 
design, assembly, incorporation into plasmids, and transfection of constructs lor sucn fusion 
proteins and reporter genes is available in the following published international patent 
applciations: WO 94/18317, WO 95/02684, WO 96/20951, WO 95/24419 and WO 96/41865. 
the contents of which are incorporated herein by reference. 

IV. The Assay Method of the Invention 

Genetically engineered cells containing the vanous DNA constructs descnbed above may 
used to carry out the assay method of this invention. Such assays permit one to identify the 
presence in a test composition of an inhibitor of the binding of a PBP with a ryrosine- 
phosphorylated ligand therefore. Test compositions or compounds to be assessed for their 
inhibitory activity can be obtained from a variety ot sources, including, for example, microbial 
broths, cellular extracts, conditioned media from cells, synthetic compounds and combinatorial 
libraries, and may be tested individually or in pools. The assay method of this invention may be 
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used to screen natural product and test compound libraries or structurafry-biased diversity 
libraries to identify desired inhibitors. The test composition may be selected from a mature of one 
or more test peptides, wherein said mixture is provided in the form of a library of synthetic 
peptides or in the form of a phage library displaying the various peptides. 

5 

A The General Method 

The method involves the use of a cell as described above, particularly a cell in which the 
second fusion protein also contains a heterologous kinase domain or the cell contains an endogenous 
or heterologous kinase capable of tyrosine-pnosphorylating the ligand domain. According to this 

0 method, the above-described engineered cells are cultured or maintained in a conventional culture 
medium under suitable conditions permitting growth of the cells both in the presence and absence of 
the test composition. For example, the cell is cultured in standard tissue culture media containing 
the drugs necessary to select for cells which stably retain the introduced genes described above. 
Cells may be cultured in standard tissue culture dishes, e.g. multidishes and microwell plates, or in 

5 other vessels or arrays of vessels, as desired. 

"The test compounds are added to the culture media to assay their effect on PBP interaction 
CelJs at this time may be bathed in tissue culture media {with or without serum) or a balanced salt 
solution. The compounds to be tested may be cell permeable and therefore added directly to cells. 
Alternatively, it may be necessary to first make the cells permeable using streptolysin O, 

0 tetanorysin or another cell permeabiiizing agent. 

Dunng incubation and growth of the cells in the absence of an inhibitor test composition, the 
cell is capable of expressing the above recited fusion proteins and receptor gene product. Normally 
this involves the DNA-binding fusion protein binding to the ONA element of the receptor gene. 
Tyrosine residues in the peptide ligand domain are phosphorylated by the kinase activity 

5 (endogenous or heterologous) m the cell. The phosphorylated peptide ligand, e.g., an IT AM, binds its 
PBD on the other fusion protein, e.g.. an SH2 domain (see, e.g.. Pawson. US Patent No. 5.352.660 
(4 October 1994)] to generate the Iwo-hybnd complex. The end result is measurable expression 
of the detectable reporter gene product. 

Thus, an additional step of this method is determining whether production of the detectable 

j gene product was dimmisheo in the presence of the test composition. After transcription is induced, 
cells or media are evaluated to measure reponer gene expression. In some configurations, the assay 
is conducted in 96 well plates or other arrays, and the reporter gene expression is measured in 
situ. In other cases, cells or media are harvested and an extract prepared (if necessary) for 
evaluation. Comparison of the measurements of detectable reporter gene product in the presence 

3 and absence of the test composition permits the identification of the test composition as an inhibitor 
ot the PBP interaction. Transcription of the detectable gene product induced due to PBP interactions 
is assayed using standard coionmetnc or fluorescence assays. 

For example, rf the test compound is not an inhibitor of the formation and/or stability of the 
PBP complex, the measurement of detectable reporter gene product in the presence and absence of 

) the text composition will be about equivalent, since the results of a noninhibrtor will be similar to 
results obtained wfth no test compound, as described above. If the test compound is a putative 
inhibitor of the PBP mteraction, a decrease in the level reporter gene expression is observed This 
decrease indicates that test compound inhibited the binding of the PBD to its phosphorylated peptide 
ligand, and thus inhibited the tormatio or persistenceof the two-hybrid pomplex. The receptor gene 

» expression is thus extinguished or reduced, depending on the amount of test composition present in 
the culture and the strength of the binding cetween the test compound and either member of the 
PBP. 

For example, in one embodiment of this invention, test compounds are screened in eukaryotic 
host cells for blocking/inhibiting tne interaction between tne ZAP-70 tandem SH2 domain with the 
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TCR zeta JTAM /n vrvo (see Example 3). Compounds that block ZAP SH2 binding to ITAM sequence 
are detected using a first fusion protein comprising the vSrc kinase domain, estrogen receptor 
domain and the TCR zeta ITAM ligand fused to a DNA-binding domain and a second fusion protein 
comprising the tandem SH2 domain ol ZAP fused to a TAD. The interaction of the ZAP SH2 domains 
with the zeta ITAM domain brings the activator domain in proximity to the DNA element, inducing 
reporter gene transcription. Compounds that block ZAP SH2- zeta ITAM interaction reduce 
reporter gene expression. 

In this embodiment, and by way of illustration, the assay method of this invention otters 
several advantages over traditional T cell assays. The assay readout is directly dependent on the 
interaction between the particular SH2 ligand and the SH2 domain-containmg polypeptide (e.g.. the 
ZAP-70 tandem SH2 domain). In contrast, in standard T cell assays that measure cell 
proliferation, cytolytic activity or cytokine production, there are many steps between activation 
and the assay endpoints Therefore, many nonspecific inhfcitors score in standard T cell assays, 
which will not score in the assay of this invention. 

B. Eliminating Non-Specific Inhibition 

To exclude test compounds that inhibit reporter gene expression by nonspecific mechanisms 
(e.g., by having a general inhibitory effect on cellular transcription or translation), test • 
compounds are also assayed tor their effect on reporter gene expression when reporter gene 
transcription is not dependent on PBP interactions. For this purpose, an indicator cell line is 
utilized wherein the transcnption activation domain is brought directly to DNA by the DBD (by 
expressing in cells a DBD/transcnption activation domain fusion). To rule out false positives based 
on otherwise nonrelevant effects of a test composition relating to the HBD. the fusion protein may 
further contain a hormone binding domain (HBD). In such cases, the onry difference between the 
positive control indicator cell line and the assay cell line is the dependence of the assay cell line 
upon PBP complex formation tor reporter gene expression. 

General inhibitory effects of transcription are also revealed by performing radiolabeled 
uridine puise tabelings during exoosure to the test ccmpound. Inhibition ot general incorporation of 
radiolaoel indicates a general inhibitory effect on Transcription. 

C Simultaneous Exposure of the Cell to the Assay Components 

in another aspect ot this method, the first fusion protein transtected into the cell contains an 
HBD as oescnbed above. The HBD is an important feature ot this aspect of the method ot use of this 
cell Test compounds introducea mto celts may be labile and therefore not present tor long periods 
of time once introduced into the cell. It would be advantageous to have all interacting components 
present in the celt at the time the cells are exposed to potential inhibitors ot the PBP interaction. 

Inhibitors of phosphopeptide binding prote^protein interactions could exert their effect in 
two ways They could compete with the ligand/HBD/TAD domain tor the PBD or displace the 
ligand/HBD/ TAD, which has comptexed with the PBD. Mechanistically, it will be easier to block 
than displace the interaction between the binding domain and its ligand. Potential inhibitors, i.e.. 
test compounds, can be introduced into ceils prior to complex formation (hormone addition) to 
block complex formation. 



D. Cett-Free Assay 

In another embodiment of this method, cell-tree transcription systems prepared from cells 
as descnbed above or prepared from the purified components oescnbed above may be used in lieu of 
the engineered cells. In a cell tree implementation, oeptide ligands may be synthetic or 
nonsynthet.c peptides, peptords or small molecules identified from libranes. cell broths, natural 
extracts etc The method of the assay is the same as above, except that it is performed in a cell tree 



20 



WO 97/31113 PCT/US97/02635 



environment. In this case, the kinase domain should preferably be part of the fusion protein 
containing the DBD and the ligand domain for the phosphopeptide binding domain. 

V. The Identified Phosphopeptide Binding Pair Inhibitors 

5 A PBP inhibitor identified by the methods of this invention selectively binds to a PBD of 

interest, such as SH2 domain. Such an inhibitor can block or inhibit protein-protein. 
protein-peDtide. protein-nucleotide, protein-polynucleotide, protein-lipid, protein-carbohydrate 
or protein-small molecule interactions mediated by a phosphopeptide binding domain of interest. 

Once an agent has been identified as an phosphopeptide binding or blocking/inhibiting agent, it 

0 can be produced using known methods, such as by recombinant methods of protein production or 
chemical synthesis. It can also be obtained from the source tn which it was initially identified. 

A Counterscreens 

Having identified a PBP inhibitor by means of the above assay, rapid and high throughput 

5 counterscreens using cells expressing a monomeric transcription factor (i.e. a positive control 
fusion protein containing one or more copies of both the DBD and TAD) that drives reporter gene 
expression independent of a protein kinase can readily identify nonspecific inhibitors, or confirm 
inhibitor specificity Test compounds identified as inhibitors by the method of this invention may 
be further evaluated for binding activity with respect to one or more additional PBDs of interest. 

D or with respect to additional proteins containing the domain(s), using various approaches, a 
number of which are well known in the art. 

The counterscreen assays may include all of the core features of the above-described general 
assay of this invention, but the fusion proteins are altered to determine whether the inhibitor 
binds to other ligands, or other domains. The readout is independent of the particular PBD peptide 

5 and peptide ligand used in the original screen. Such counterscreens may be repeated as described to 
obtain the specificity. Therefore, any nonspecific inhibitors (e.g., RNA transcription, protein 
synthesis) can be identified. 

For instance, such identified inhibitors may be evaluated for activity as competitive 
inhibitors of the binding of an SH2 domain wrth a phosphorylated hgana thereto |e.g. Pawson, US 

: Patent No. 5.352.660 (4 October 1994)). As another example, the counterscreen assay may use a 
control cell containing and capable of expressing the reporter gene and its associated control 
elements and a gene encoding a control fusion protein The control fusion protein contains both the 
DBD and the TAD which are separately present m the respective DNA binding and transcription 
activator fusion proteins, previously described. Alternatively, two fusion proteins may be used, 
each containing one haft of a binding partner with different binding specificities than those found 
tor PBD and the protein-protein interaction assayed as described above. The failure of the test 
compound to significantly alter the level of expression of the reporter gene confirms its activity as 
an inhibitor. 

Another counterscreen assay useful rn evaluating test compounds shown to be inhibitors by 
' the method of this invention tor binding to one or more additional PBDs of interest is surface 
plasmon resonance (BIAcore<S>) technology [see. e.g., Panayotou et at, Mol. Cell. Biol. . \3 : 
3567-3576 (1993)] 

In another counterscreen, a plasmid that drives the expression of a nuclear located tyrosine 
phosphatase, linked to the estrogen receptor ligand binding domain can be introduced into the cells 
A number of proteins have had their function rendered steroid hormone-dependent by fusion to the 
steroid hormone receptor ligand binding oomain. The hormone binding domain of steroid hormone 
receptors acts as a molecular switch. In the absence of cognate ligand, the fusion protein is inactrve. 
However, hormone binding leads to the very rapid (within seconds or minutes) "gain of function" of 
the chimeric protein. Heterologous proteins made hormone-dependent tor function include both 
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enzymes and transcription factors. Thus, with the addition of estrogen, detectable protein 
production would be curtailed, due to the dephosphorylation of the peptide ligand. Such cells could 
be used to screen for phosphatase inhibitors that block the reduction of reporter production that 
would occur wnen the cells are exposed to estrogen. 

In a further possible counterscreen, compounds may be tested in an assay as described 
herein but replacing the PBD and peptide ligand domains with SH3 and SH3 l»gand domare as 
described in W095/24419. Compounds should not be active unless they inhibit a non PBP-related 
component of the assay system or also inhibit SH3«mediated protein-protein interactions. 

The inhibitors identified in the assay system of this invention can be further evaluated by 
conventional methods for possible therapeutic applications, toxicotogtcal and pharmacological 
activity For example, test compounds so identified as inhibitors may further be evaluated lor 
activity in inhibiting cellular or other biological events mediated by a pathway involving the 
PBD-based interaction of interest using a suitable cell-based assay or an animal model. Cell-based 
assays and anrmal models suttaWe tor evaluating inhibitory activity ol a test compound with 
respect to a wide variety of cellular and other biological events are known in the art. New assays 
and models are regularly oeveloped and reported in the scientific literature. 

By way of nonlimitmg example, compounds which bind to an SH2 domain involved in the 
transduction of a signal leading to asthma or allergic episodes may be evaluated in a mast celt or 
basophil degranulation assay. The inhibitory activity of a test compound identified as an SH2 
.nhibrtor by the method of this invention with respect to cellular release of specific mediators such 
as histamine, ieukotrtenes, hormonal mediators and/or cytokines, as well as its biological acttvrty 
wrth respect to the levels of phosphatidylinositol hydrolysis or tyrosine phosphorylation can be 
characterized with conventional in vitro assays as an indication of biological activity. [See. e.g.. 
Edward L. Barsumian et al. F»" J Immunol.. 11:317-323 (1981); M. J. Forrest. BiPChem , 
Pharmacol. 4£1221-1228 (1991) (measuring N-acetyt-betaglucosaminadase from activated 
neutrophils); and V. M. Stephan et al., J Biol, Chem,, 2fiZ:5434-5A41 (1992)). 

For example, histamine release can be measured by a radioimmunoassay using a kit available 
from AMAC Inc. (Westbrook. ME). One can thus evaluate the biological activity of inhibitors 
lOentrhed by the methoa ot this invention and compare them to one another and to known active 
compounds or clinically relevant compounds which can be used as positive controls 

Generally speaking, in such assays IC50 scores of 150-300 uM are considered of interest, 
scores of 50*150 uM are considered good, and scores below about 50 uM are ol high interest. 
Pnor to in vivo models, inhibitors identified by this invention may also be tested in an ex vivo 
assay for their ability to block antigen-stimulated contraction of sensitized guinea pig tracheal 
stnp tissue. Activity in this assay has been shown to be uselul in predicting the efficacy of 
potential anti-asthma drugs. 

Numerous animal models ot asthma have been developed and can be used [for reviews, see 
Larson. -Experimental Models ol Reversible Airway Obstruction', in THE LUNG. Scientific 
Foundations, Crystal. West et al. (eds.). Raven Press. New York. pp. 953-965 (1991); Warner 
et al Am Rev Resmr Pis. . 1 41-253-257 (1990)]. Species used in animal models of asthma 
include mice rats guinea pigs, rabbits, dogs, sheep and pnmates. Other in vrvo models available 
are described in Cross et al.. I *h Invest.. £2:162-170 (1990). and Koh. et al.. Sciepce , 
^fi.:1210-1213 (1992). 

By way of further example, compounds identified as PBP inhibitors by the method ot tms 
invention which bind to an PBD involved in the transduction ot a signal involved in the initiation, 
maintenance or spread of cancerous growth may be evaluated in relevant conventional in vitro and 
,n vivo assays. See e.g.. Ishii et al.. An t ftiP l.. XUL: 1877-1 678 (1989); and US Patent 
5.206.249 (issued 27 April 1993). 
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VI. Uses of Inhibitors Identified by This Invention 

Inhibitors identified by this invention may be used as biological reagents in assays as 
described herein for functional classification of a PBD of a particular protein, particularly a newly 
discovered protein. Families or classes of PBD-beanng proteins may now be defined functionaJry. 

5 with respect to iigand specificity. Moreover, inhibitors identified by this invention can be used to 
inhibit the occurrence of biological events resulting from molecular interactions mediated by a 
PBP. Inhibiting such interactions can be useful in research aimed at better understands^ the 
biology of PBP-mediated events. 

Such binding or blocking agents would be useful, for example, in the diagnosis, prevention or 

0 treatment of conditions or diseases resulting from a cellular processes mediated by a PBP 
interaction. For example, a patient can be treated to prevent the occurrence or progression of 
osteoporosis or to reverse rts course by administering to the patient in need thereof an SH2 binding 
or blocking agent which selectively binds Src SH2. 

There are many other conditions for which phosphopeptide binding or blocking agents may be 

5 useful therapeutically, including, e.g.. breast cancer where the SH2 domain-containmg proteins 
Src. PLCgamma and Grb7 have been implicated. Other relevant conditions include prostate cancer, 
in which case targeting Grb2, PLCg. and PI3K, alt of which contain SH2 domains, may be useful in 
treatment or prevention of the disease. Inhibition of the interaction of Grb2 or Abl SH2 domains 
with Bcr-abl may be useful to treat chronic myelogenous leukemia (CML) or acute myelogenous 

0 leukemia (AML). 

Still other relevant applications of an PBP inhibitor would be to prevent interferon-, growth 
factor-, or cytokine-mediated diseases (e.g. inflammatory diseases) by targeting the PBDs of STAT 
proteins. Agents that block the SH2 domains of 2AP-70, which is believed to be involved in 
activation of T -cells, would be useful in the treatment of autoimmune diseases. An inhibitor that 

5 blocks one or both SH2 domains of ZAP-70 would also be useful as an immunosuppressant to 
prevent rejection of skin and organ transplants. 

By virtue of the capacity to inhibit protein-protein interactions required for cellular events 
of pharmacologic importance. PBD/peptide Iigand interaction inhibitors identified by the method 
may be used in pharmaceutical compositions and methods tor treatment or prevention in a subject 

0 in need thereof. Such inhibitors can be used to treat or prevent the diseases or their pathological 
effects mediated by such interactions. 

For example, drugs that completely block one of the two ZAP SH2 domains should effectively 
prevent ZAP from associating with the activated TCR and thus block T cell activation. A ZAP 
antagonist or inhibitor would specifically inhibit T cells and avoid the toxicity of the currently 

5 used immunosuppressive drugs. FK506 and cyclosporin, which target the more uOiqurtously 
expressed protein, calcineurin. Since calcmeuhn is required for cellular activities in several 
tissues in addition to T cells, cyclosponn and FK506 cause side effects in the kidney and central 
nervous system which limit their application largely to patients with organ transplant rejection. 

0 VII. Pharmaceutical Compositions and Methods 

A Compositions 

Inhibitors identified by this invention can be formulated into pharmaceutical compositions 
containing a therapeutically (or prophylactically) effective amount of the inhibitor in admixture 
5 with a pharmaceutical^ acceptable carrier and/or other excipients (i.e., pharmaceutical^ 

acceptable organic or inorganic carrier substances suitable for parenteral admmistration) using 
conventional materials and means. Such a earner includes but is not limited to saline, buffered 
saline, dextrose, water, glycerol, ethanol, and combinations thereof. The earner and composition 
can be sterile. The formulation should suit the mode of administration. 
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The composition, if desired, can also contain minor amounts of wetting or emulsifying agents, 
or dH buttering agents. The composition can be a liouid solution, suspension, emulsion, tablet, pill, 
capsule, sustained release formulation, or powder. The composition can be formulated as a 
suppository, with traditional binders and earners such as tnglycendes. Oral formulation can 
include standard carriers such as pharmaceutical grades of mannrtol, lactose, starch, magnesrum 
stearate, sodium saccharine, cellulose, magnesium carbonate, etc. 

In a specific embodiment, the composition is formulated in accordance with routine 
procedures as a pharmaceutical composition adapted for intravenous administration to human 
beings. Typically, compositions for intravenous administration are solutions in sterile isotonic 
aqueous buffer. Where necessary, the composition may also include a solubiJizing agent and a local 
anesthetic to ease pain at the side of the injection. Generally, the ingredients are supplied either 
separately or mixed together in unit dosage form, for example, as a lyophilized powder or water 
free concentrate in a hermetically sealed container such as an ampoule or sachette indicating the 
quantity of active agent. Where the composition is to be administered by infusion, it can be 
dispensed with an infusion bottle containing sterile pharmaceutical grade water or saline. Where 
the composition is administered by injection, an ampoule of sterile water for injection or saline 
can be provided so that the ingredients may be mixed prior to administration. 

Topical compositions include a pharmacologically acceptable topical carrier, such as a gel. an 
ointment, a lotion, or a cream, which includes, without limitation, such carriers as water, 
glycerol, alcohol, propylene glycol, fatty alcohols, triglycerides, fatty acid esters, or mineral oils. 
Other topical carriers include liquid petroleum, isopropyl palmrtate, polyethylene glycol, ethanol 
(95%) polyoxyethylene monolaurate (5%) in water, or sodium lauryl sulfate (5%) m water. 
Other materials such as antioxidants, humectants, viscosity stabilizers, and similar agents may 
be added as necessary. 

Materials and methods for producing Ihe various formulations are welt known in the art (see 
e.g. US Patent Nos. 5.182.293 and 4,837.311). 

B Methods 

The invention proviaes methods ot treating, preventing and/or alleviating tne symptoms 
and/or seventy ol a disease or oisoraer referred to above Dy administration to a subject of a PBP 
.nhibiior in an amount effective therefor. The subject will be an animal, mclud.ng but not limited 
to animals such as cows, pigs, chickens, etc.. and is preferably a mammal, ano most preferably 
human. By -mammals - is meant rodents such as mice, rats and guinea pigs as well as dogs, cats, 
horses, cattle, sheep, nonhuman primates and humans. Such effective amounts can be readily 
determined by evaluating the inhibitors identified by this invention in conventional assays 
well-known in the art. including assays described herein. 

Administration of such composition may be by any conventional route using appropnate 
formulations as are well known in this art. Various delivery systems are known and can be used to 
administer the inhibitor, e.g., encapsulation in liposomes, microparticles. microcapsules. One 
mode of delivery of interest is via oulmonary administration. Other methods of introduction include 
but are not limited to intradermal, intramuscular, intraperitoneal, intravenous, subcutaneous, 
intranasal, epidural, nasal and oral routes. The inhibitor may be administered by infusion or bolus 
injection by absorption through epithelial or mucocutaneous linings (e.g., oral mucosa, rectal and 
intestinal mucosa, efc) and may be administered together wrth other biologically active agents. 

Administration can be systemic or local. For treatment or prophylaxis of nasal, bronchial or 
pulmonary conditions, preferred routes ot administration are oral, nasal or via a bronchial aerosol 
or nebulizer In specific embodiments, rt may thus be desirable to administer the inhibitor locally 
to the area in need of treatment; this may be achieved by. for example, and not by way of limitation, 
local infusion during surgery, topical application, by injection, by means of a catheter, by means 
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of a suppository, or by means of a skm patch or implant, said implant being of a porous. 

nonporous. or gelatinous material, including membranes, such as siaiastic membranes, or fibers. 
Administration to an individual of an effective amount of the inhibitor can also be 

accomplished topically by administenng the compound(s) directly to the affected area of the skin of 
5 the individual. In certain instances, it is expected that the inhibitor may be disposed within devices 

placed upon, in, or under the skin. Such devices include patches, impiants. and injections which 

release the compound into the skin, by either passive or active release mechanisms. 

The amount of the inhibitor which will be effective in the treatment or prevention of a 

particular disorder or condition will depend on the nature of the disorder or condition, and can oe 
0 determined by standard clinical techniques. In addition, in vitro or rn vivo assays may optionally be 

employed to help identify optimal dosage ranges. Effective doses may be extrapolated from 

dose-response curves derived fTom in vitro or animal model test systems. For example, a typical 

effective dose of the inhibitor is in the range of about 0.01 to about 50 mg/kgs, preferably about 

0.1 to about 10 mg/kg of mammalian body weight, administered in single or multiple doses. 
5 Generally, the inhibitor may be administered to patients in need of such treatment in a daily dose 

range of about 1 to about 2000 mg per patient. 

The precise dosage level of the inhibitor, as the active component(s). should be determined by 

the attending physician or other health care provider and will depend upon well known factors. 

including the phosphopeptide binding interaction under consideration, the route of administration, 
0 and the age, body weight, sex and general hearth of the individual; the nature, seventy and clinical 

stage of the disease; and the use (or not) of concomitant therapies. 

C Kits 

The invention also provides a pharmaceutical pack or kit comprising one or more containers 
5 filled with one or more of the ingredients of the pharmaceutical compositions of the invention. 
Optionally associated with such contamer(s) can be a notice in the form prescribed by a 
governmental agency regulating the manufacture, use or sale of pharmaceutical or biological 
products, which notice reflects approval by the agency of manufacture, use or sale for human 
administration. 

3 Other components such as physiologically acceptable surfactants (e.g.. giycendes). 

excipients (e.g., lactose), carriers, and diluents may also be included. 

The following examples illustrate various aspects of this invention These examples do not 
limit the scope of this invention which is defined by the appended claims. 

5 

Examples 

EXAMPLE 1: PLASM IDS 

) A Ptasmid pYSZ, a GAL4 (1-l47)-vSrc kmase-Zeta IT AM yeast expression vector. 

Plasmid pGTB9 (Clontech Laboratories) was used for the expression of GAL4 (l-147)-vSrc 
kmase-Zeta ITAM in yeast. This plasmid has a 2 micron origin of replication and sequences for 
Amp (bacteria) or Trp (yeast) selection pGTB9 also has ADH1 regulatory sequences that direct the 
expression of the GAL4 DBD fusion proteins. Downstream of GAL4 coding sequences are multiple 
restnction endonuciease cloning sites. 

The vSrc gene (Genbank Accession #J02342) kinase domain was introduced downstream of 
GAL4 sequences in pGTB9 as an EcoRl/BamHI fragment after PCR amplification The 5' and 3" 
oligonucleotides, respectively, utilized for PCR were: (EcoRI end) cgGAATTCtccaagccccagaccca [SEQ 
ID NO;1) and (Bam H1 end) gcGGATCCctcagcgacaccaaca [SEQ ID NO:2]. These oligos amplify 
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»sidues 248-526 of vSrc. PCR generated vSrc kinase DNA was sequenced after cloning to check 
ie fidelity of the amplification process. 

Sequences encoding human Zeta chain cytoplasmic residues (Genbank Accession* J041 j^. 
>sidues 52-164) cloned into pET23b (Invitrogen) were excised using Nde 1 and Hind III. The 
-sthction fragment encoding Zeta residues was blunt ended using T4 DNA polymerase and ligated to 
lunt ended Sail digested P GTB9-vSrc K.nase DNA The recombinant DNA was digested with 
asthction enoonucleases to identity a done which contained Zeta coding sequences m the proper 
rientation. 

B pYAZ22, a GAL4 activation domain-tandem ZAP SH2 yeast expression vector. 

Plasmid pGAD424 (Clontech Laboratones) was used lor the expression of GAL4 activation 
omain-tandem ZAP SH2 fusion protein in yeast pGex2KT-huZAP( 1-259), containing residues 
-259 of human ZAP-70 (Genbank Accession # L05148) was digested with BamHI and blunt ended 
sing T4 DNA polymerase. After the addition of EcoRI linkers (New England Biolabs. catalog 
1020). nucleotide sequence encoding the tandem SH2 domains of ZAP (residues 1-259) was 
iolated as an EcoRI fragment and cloned into the EcoRI site of pGAD424. 

The recombinant DNA was digested with restriction endonudeases to identity a clone which 
ontatned ZAP SH2 domain coding sequences in the proper orientation. 

C pYerSZ. an estrogen regulated GAL4 (1-147)-vSrc kinase-Zeta IT AM yeast expression 

' eC '°Human estrogen receptor hgand binding domain (l.d.b.) coding sequences (residues 
>82-595) were amplified by PCR from pHuER (Genbank Accession* M12674) ^ n 9^ 5 M ^ 3 
»Ugonucleotide pnmers. respectively, (EcoRI end) cgGAATTCtctgctggagacatgagagct ISEQ ID N0^3] 
in d (EcoRI end) cgGAATTCgactgtggcagggaaaccct |SEQ ID NO:4). After digesting the amplrfied DNA 
,smg restnct.cn endonuclease EcoRI. estrogen receptor l.d.b. sequences were cloned into EcoRI cut 
)YSZ (Example 1A) A recombinant clone was isolated that had estrogen receptor sequences in the 
,ght orientation This new plasmid, pYerSZ. directs the expression of GAL4 DBD-estrogen 
eceptor l.d.b.-vSrc kinase-Zeta 17AM fusion protein in yeast. 

D pMSZ a GAL4 (Ul47)-vSrc kmase-Zeta IT AM mammalian celt expression vector 
pBXG1 a pECE72-based vector [Sadowski I. and Ptashne. M.. N .l f , l Ac i ds Res , , 1Z 7539 
1989)1 was used lor the expression of GAL4 DBD fusion proteins in mammalian cells pECE7Z 
Ellis L et at £eiL 4£'721 -732 (1986)) has the SV40 virus ongin of replication. SV40 early 
xomoter and SV40 polyadenylation regulatory sequences. GAL4 (1-147) and multiple cloning 
sites C-terminal to GAL 4 coding sequences were obtained from pSKGAL147 [Kakidani. H ana 
otashne, M.. £eJL 52:161-167 (1988)] as a Htnd Ill/Xbal fragment and inserted into Hmd 
ill/Xbal cut DECE72 to Generate pBXG1. 

" e ssences encod.ng vSrc Kinase Zeta ITAM ware excised from pYS2 (Example 1 A) as an 
EcoRl/Bglll (ragmant and cloned into EcoRl/BamHI digested pBXG1 to generate plasmid P^SZ 
capable of directing the expression of GAL4 (l-l47)-vSrc k.nase-Zeta ITAM .n mammalian cells 

£ pMAZ22. a Herpes Virus VP 16 activate domam-tandem ZAP SH2 domain mammaltan 

cell expression vector. . , h 

pMVN1 (Ivan Sadowski. University of British Colombia. Vancouver, CA) was used lor tne 
production of transcription actuation doma.n-tandem ZAP SH2 domain fusion proteins in 
mammalian ceUs. pMVN 1 contains the SV40 early promoter. HSV TK translation* leader 
seouence SV40 nuclear local.zaiion sequences and VP16 act.vation domain residues followed by 
multiple cloning sites for the construction of novel fusion proteins The sequences encoding the 
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tandem SH2 domains of ZAP (residues 1-259) were excised from pYA222 (Example 1B) as an 
EcoRi fragment and cloned into EcoRi digested pMVNM in the proper orientation 

F. pMerSZ, a GAL4 (Ul47)-estrogen receptor l.b.d. -vSrc ktnase-Zeta ITAM 
5 mammalian cell expression vector. 

Human estrogen receptor l.d.b. coding sequences were prepared as described in 1C above. 
After digesting the amplified DNA using EcoRi, estrogen receptor l.b.d. sequences were ctoned 
rnto EcoRI-cut pMSZ (Example 1D). A recombinant clone was isolated that had estrogen receptor 
sequences in the right orientation. This new plasmid, pMerSZ, directs the expression of GAL4 
0 DBD-estrogen receptor l.b.d.-vSrc kinase-Zeta ITAM fusion protein in mammalian cells. 

G pYSB t a GAL4 (Ul47)-vSrc kinase-Beta ITAM yeast expression vector. 

pGTB9-vSrc Kinase DNA (see Example 1A) was engineered for the production of GAL4 
DBD-vSrc kinase-Beta ITAM chain fusion protein production. Beta sequences (Genbank Accession* 
5 S21154) were amplified from pCD8-beta [Rivera. V. and Brugge, J., Mol Cell. Biol. . 
1£: 1582-1 590 (1995)) using the 5' and 3' oligonucleotides, respectively: 
(BamHI end) gcGGATCCggagctggggaagaactca [SEO ID NOS] and (Sail end) 

acgcGTCGACttataaatcaatgggaggag (SEQ ID NO.:6). The resulting amplified DNA, after digestion with 
BamHI and Sail, was cloned into BamHI/Sall digested pGTB9-vSrc Kinase. 

0 

H. pYerSB, a GAL4 (1 -l47)-estrogen receptor l.b.d. -Src kinase-Beta ITAM yeast 
expression vector. 

The same human estrogen receptor l.b.d. coding sequences were prepared and amplified by 
PCR as described in 1C above and cloned rnto EcoRi cut pYSB (Example 1G). A recombinant clone 
5 was isolated that had estrogen receptor sequences in the right orientation. This new plasmid. 
pYerSB. directs the expression of GAL4 DBD-estrogen receptor l.b.d. -vSrc kinase-Beta ITAM 
fusion protein in yeast. 

/. pYAS32. a GAL4 activation domain- Src SH3. / SH2 yeast exoression vector 
j pGAD424 (Clonetech Laboratories) was used for tne production of transcription activation 

aomain-human Src SH3/SH2 domain fusion proteins in yeast cells. Sequences encoding the Src 
SH3/SH2 domain (residues 84-249) [Tanaka. A. and Fujita. D.J.. Mol. Cell. Biol 6:3900-3909 
(1986)) were amplified by PCR using the 5' and 3' oligonucleotides, respectively: 
(EcoRi end) cctcacGAATTCggtggagtgaccacctttgtggcc [SEO ID NO: 7] and (BamHI end) 
5 cxactcGGATCCgccggggcacacggtggtgaggc [SEO ID NO:B). After restriction endonuciease digestion using 
EcoRi and BamHI. the PCR product was ligated to EcoRI/BamHI digested pGAD424. 

J. pYAS2, a GAL4 activation oomam-Src SH2 yeast express ton vector. 

PGAD424 was used for the production of transcription activation domain-human Src SH2 
) domain fusion proteins in yeast celte. Sequences encoding the Src SH2 domain (residues 

144-249) [see. Tanaka. A. and Fujita. D.J. (1986) cited above) were amplified by PCR using the 
5' and 3' oligonucleotides, respectively: 

(EcoRi end) cctc^cGAATTCggcgactccatccaggctgaggag [SEO ID NO:9] and (BamHI end) 
ccacu^GArCCgcxx^gggc^c^o^gg^aggc [SEQ ID NrO:10). 

After digestion with EcoRi and BamHI. the PCR fragment was cloned into EcoRI/BamHI 
digested pGAD424. 
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K. pMSB. a GAL4 (1-147)-vSrc kinase-Beta ITAM chain mammaiian ceil expression 

/eCf °pYSB (Example 1G) was cut with restriction endonucleases EcoRI and Bglll and the DNA 
ragment encoding vSrc kinase-Beta ITAM chain residues was isolated and Hgated to EcoRI/BamHI 
digested pBXG1 (Example 1G). 

L. pMerSB. a GAL4 (1-147)-estrogen receptor l.b.d. -vSrc kinase-Beta ITAM chain 
mammahan ceil expression vector. , 

pMSB (Example 1K) was cut with restriction endonuctease EcoRI. An EcoRl DNA Tragmem 
containing the human estrogen receptor l.b.d. coding sequences, prepared as described in 1C above. 
*as hgated to EcoRI digested pMSB. A recombinant done was isolated that had estrogen receptor 
sequences in the nght orientation. This new plasmid. pMerSB. directs the expression of GAL4 
DBD-estrogen receptor l.b.d. vSrc kinase-Beta ITAM fusion protein in mammalian cells. 

M. pMAS32. a Herpes Virus VP16 activation domain-human Src SH3/SH2 domain fusion 
protein mammaiian cell expression vector 

pMVN 1 (Ivan Sadowski, Unrv. B.C., Vancouver, CA) was used tor the production of 
TAD-human Src SH3/SH2 domain fusion proteins m mammalian cells. Sequences encoding the Src 
SH3/SH2 domain (residues 84-249; see above) were amplified by PCR using the 5 and 3' 
oligonucleotides, respectively: 

(EcoRI end) cctcacGAATTCggtggagtgaccacctttgtggcc (SEQ ID NO:1 1] and (BamHI end) 
ccac^GATCC^cxgg^oc^cacggtg^gaggc [SEQ ID NO: 12]. 

After restriction endonudease digestion using EcoRI and BamHI, the PCR product was l.gated 
to EcoRI/BamHI digested pMVN 1. 

N. pMAS2. a Herpes Virus VP16 activation domain-human Src SH2 domain fusion protein 
mammaiian cell expression vector 

Sequences encoaing the Src SH2 domain (residues 144-249) [Tanaka et al. supra) were 
amplit.ed by PCR using the 5* and 3" oligonucleotides, respectively: 
(EcoR I end) cctcacGAATTCggcgaciccatccaggctgaggag [SEO ID NO:13) and (BamHI end) 

cx^cGGATCC^xggg^ w # c ~ l/RamHI 

After digestion with EcoRI and BamHI. the PCR fragment was cloned into EcoRI/BamHI 

digested pMVN1 (Example 1M). 

Q pMerVP, a GAL4 DBD-estrogen receptor Lb.d.-VP 16 TAD fusion protein mammalian 

cell expression vector. 

An EcoRI DNA fragment containing the human estrogen receptor Lb.d. coding sequence* 
prepared as described in 1C above was l.gated to EcoRI digested pBXG1 (Example 1G) Clones 
containing the estrogen receptor sequences in the right orientation (designated pMer) we "> 
.dentrtied by restriction endonucleases analysis. VP16 TAD residues were amplrf^d using template 
plasmid pMVN1 (Example 1M) and the 5 and 3' oligonucleotides, respectively: (BamHI end) 
^GGATCCagxx^tgggggacgagctc [SEQ ID NO:t5] and (Spel end) g^CTAGTcccaccgiactcgtcaat (SEQ ID 
NO:16]. After digestion with BamHI and Spel, the PCR fragment was cloned mto BamHl/^pei 
digested pMer. 

P. pMAF2. a Herpes Virus VP16 activation domain-human Pyn SH2 domain fusion protein 
mammalian cell expression vector. 
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Seouences encoding the Fyn SH2 domain (residues 149-251) [Genbank M14333] were 
amplified by PCR using the 5' and 3' oligonucleotides such that the Fyn SH2 domain coding 
sequences contained EcoR1 and BamH1 ends. After digestion with EcoRI and BamHI, the PCR 
fragment was cloned into EcoRl/BamHI digested pMVN1 (Example 1M). 

EXAMPLE 2: STABLE MAMMALIAN CELL LINE 

A variety of cell lines were established by introducing selected plasmid DNA, described in 
Example 1 into mammalian cells using lipofectin reagent (Gibco BRL) as suggested by the 
manufacturer. 

) A selected host cell line is the mammalian HT 1080 fibrosarcoma cell line (American Type 

Culture Collection Accession No. CRL7951]. These cetts are prepared by hpotection using a SEAP 
reporter construct comprising 

( a ) a GAL4 responsive/dependent promoter (5 GAL4 binding sites linked to the core IL-2 
promoter, the "G5-IL2" promoter), which is placed upstream of 
3 ( b ) secreted alkaline phosphatase (SEAP) coding sequences; and 

a plasmid containing the hygromycm gene. 

This cell line G5-IL2 HT1080 was employed in a variety of the examples below. 

EXAMPLE 3: TWO-HYBRID MECHANISM-BASED IN VIVO ASSAY - ZETA 
) ITAM-TANDEM ZAP SH2-DEPENDENT TWO-HYBRID FORMATION IN MAMMALIAN 
CELLS 

In a prototypic example of an assay of this invention, a two hybrid interaction recapitulates a 
critical event that occurs during T-cell activation, namely the SH2 domain-dependent binding of 
the ZAP-70 tyrosine kinase to the TCR zeta ITAM chain of the T-cell receptor. 

The G5-IL2 HT1080 fibrosarcoma cell line of Example 2 is transfected with two effector 
plasmids by lipotection: 

1 ) pMSZ (Example 1D) directs the expression of a novel fusion protein that consists of 
GAL4 DBD residues, the vSrc kinase domain and the human zfTAM chain This fusion protein should 
bind to DNA and have tyrosine kinase activity The Clustering of these molecules, localized at the 

) G5-IL2 promoter, should help promote efficient zITAM phosphorylation. 

2 ) pMAZ22 (Example 1E) drives the expression of the tandem SH2 domains of ZAP. tused 
to the herpes virus VP16 TAD. 

For use as controls and comparisons. HT1080 cells were also transfected with earner DNA 
only, or pMS2 only, or pMS (GAL4 DBD-vSrc kinase), or pMS and pMA222. 

The transfected host cells are cuftured in 6 well dishes. Forty-eight hours after 
transfection, the amount of SEAP released mto the media was determined using a fluorescence assay 
[J. Berger et al, Gene . ££:1-10 (1988)]. The extent of two-hybrid formation (i.e. the effect on 
transcription and the amount of ZAP SH2-Z (TAM interaction) is ascertained by measuring the 
amount (i.e.. the activity) of the SEAP reponer gene released into the media. 

The results of this assay are shown in FIG. 2. Cells transfected with earner DNA do not 
release SEAP into the media (FIG. 2, col. 1). GAL4 DBD-vSrc kinase-Zeta ITAM fusion protein, 
when expressed in cells, is not a competent transcriptional activator (FIG. 2, col. 2) However, 
when in addition, VP16-tandem ZAP SH2 fusion protein is made in these cells, transcription is 
induced due to two-hybrid formation and the cells secrete copious amounts of SEAP into the culture 
media (FIG. 2. col. 3) Thus, the requirements necessary for the induction of SEAP production in 
HT1080 fibrosarcoma cells is consistent with the interaction of tyrosine phosphorylated Zeta ITAM 
and the tandem ZAP SH2 domains. The induction of SEAP production requires that both of the fusion 
proteins described above be expressed. The recognition and binding of tyrosine phosphorylated 
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zITAM by the ZAP SH2 domains effectively bring the VP16 TAD to the promoter, thereby inducwig 
transcnption of the SEAP reporter gene. 

Two hybrid tormation is ZeU IT AM cha in-dependent (compare FIG. 2. cols. 3 and 5). 
Furthermore, GAL4 DBD fusion proteins which lack the vSrc kinase domam (GAL4 DBD-Zeta 
IT AM), or which lack the Zeta chain, i.e.. pMS (GAL4 DBD-vSrc kinase), fail to induce SEAP 
production when produced in cells atone or with pMAZ22. i.e., the VP16 TAD-tandem ZAP SH2 
fusion protein (see FIG. 2. cote. 4 and 5). See, also, cols. 1, and 2 for controls. In addition to the 
results shown in FIG. 2, cells that contain plasmids that encode GAL 4 DBD and VP16 TAD-tandem 
ZAP SH2, or GAL4 DBD-vSrc kinase-Zeta IT AM and VP16 TAD. faH to produce SEAP. 

Hence, in a heterologous cell line the tandem ZAP-SH2:hlTAM interaction that normaJry 
occurs in T -cells has been reproduced and a simple, sensitive means to monitor the extent to which 
these two molecules interact has been presented by this invention. 

Usmg SEAP production as an indicator of the interaction of these two molecules, It is possible 
to identify compounds that can specifically inhibit ZAP SH2 domain-dependent interactions When 
the host celts are cultured, they are exposed to test compounds. The extent to which the test 
compounds successfully interrupt the interaction between the ZAP SH2 and the zITAM indicates the 
presence of an SH2 inhibitor. 

With the assay set up as described, one potential drawback when screening for molecules that 
inhibit ZAP SH2-dependent interactions is that the constitutive expression of the interacting 
components of the two-hybrid screen would result in constitutive ZAP SH2-Zeta ITAM complex 
formation. Hence, this embodiment of an assay identifies only compounds that efficiently disrupt 
two-hybrid formation. 

EXAMPLE 4: TRANSCRIPTIONALLY INDUCED TWO-HYBRID FORMATION: AN 
ALTERNATIVE TWO-HYBRID ASSAY FOR MAMMALIAN CELLS SCREEN FOR 
COMPOUNDS THAT INHIBIT ZAP SH2 DOMAIN FUNCTION 

In another embodiment of the assay of this invention, cells are exposed to potential inhibitory 
test compounds and then the interaction ol proteins whose association is to be inhibited is induced. 
This embodiment of the assay depends on use of piasmids tor the proouction of both the Zeta ITAM 
and tanoem ZAP SH2 fusion proteins in the cells (as described in Example 2). which have inducible 
transcriptional regulatory sequences, rather than constitutive (i.e.. SV40 virus based) 
transcriptional regulatory sequences. 

An example of an inducible transcriptional regulatory sequence includes the mouse mammary 
tumor virus (MMTV) oexamethasone inducible promoter. Other known regulatory sequences are 
useful here Thus, if, for example, the sequences encoding GAL4 DBD-vSrc kmase-Zeta ITAM 
fusion protein were linked to the MMTV inducible regulatory sequences, fusion protein production 
would be dexamethasone-dependent. The reporter cell line is then exposed to potential inhibitory 
compounds and two-hybrid formation is induced using dexamethasone. In this embodiment Zeta 
ITAM-tandem ZAP SH2 two-hybrid formation occurs H SH2 domain(s) binding is not affected by 
inhibitory compounds. 

The mammalian G5-1L2 HT 1080 fibrosarcoma cell line (Example 2) is transtected with two 
effector plasmids by lipofection: 

( 1 ) One plasmid constitutivery drives the expression of TAD - ZAP tandem SH2. 

( 2 ) The other plasmid encodes a fusion protein GAL4DBD-vSRC kinase domain - z ITAM 
expressed using the dexamethasone (Dex) inducible MMTV promoter. 

These transfected mammalian cells are seeoed in 96 well dishes and serial dilutions of test 
compounds are added to the wells. Dex is added to the media. Phosphorylated 2 ITAM fusion protein 
accumulates in cells upon the addition of Dex. After a 24 hour incubation, media is removed from 
the 96 well dish and placed into a second 96 well dish to perform a chromogenic or fluorescence 
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assay for SEAP. The SEAP assay is measured using a 96 wen plate reader. The amount of SEAP 
released into the media can be quantrtated by using either a chromogenic or fluorescence assay. 

By binding of the fusion protein to the ZAP SH2 domains, the TAD is brought to the SEAP 
promoter and SEAP is produced. Compounds will be present in cells to bind to ZAP SH2 domains 
5 before dex-induced production of phospnoryiated h-ITAM occurs. Compounds that block ZAP SH2 
function reduce SEAP proauction. Compounds that reduce SEAP production are analyzed using cells 
expressing SEAP in a ZAP SH2-independent manner to confirm that the compound is having a 
specific effect on ZAP:SH2-z fTAM complex formation. 

0 EXAMPLE 5: TWO-HYBRID ASSAY D ETECTS BETA ITAM-SRC SH2 

DOMAIN-DEPENDENT TWO-HYBRID FORMATION IN MAMMALIAN CELLS. 

A two-hybrid assay useful for detecting Src tyrosine kinase SH2 domain- dependent 
interactions is performed as follows. 

G5-IL2 KT 1080 cells (Example 2) were transfected with the following plasmid DMAs 
5 (a) pMSB (Example 1K: GAL4 DBD-vSrc kinase-BetaJTAM); 

( b ) pMSB and pMAS2 (Example IN: VPl6-Src SH2); or 
( c ) pMSB and pMAZ22 (Example 1E: VPl6-tandem ZAP SH2). 

These transfected mammalian cells are seeded in 96 well dishes. Forty-eight hours after 
transfection, the amount of SEAP released into the media was determined using a fluorescence assay 
) The results are shown in FIG. 6. col. 1 illustrates that pMSB (GAL4 DBD-vSrc kmase-lgE 

receptor Beta ITAM fusion protein) is not by itself a competent transcriptional activator. 
However, when pMAS2 (VP16-Src SH2 fusion protein) is also present in cells, transcnption is 
induced (see col. 2). In contrast. VP16-tandem ZAP SH2 does not efficiently interact with GAL4 
DBD-vSrc kinase-Beta JTAM fusion protein to induce SEAP production (see col. 3). 

When pMAS2 was co-expressed with GAL4 DBD-vSrc kinase-Zeta fTAM in the HT1080 
fibrosarcoma indicator cell line, SEAP production was not observed (data not shown). If beta ITAM 
residues are replaced with the Zeta ITAM sequences, two-hybrid formation (SEAP production) is 
not observed using these conditions. 9 

Summarizing both Examples 4 and 5. two-hyond formation appears to be ITAM and SH2 
domain specific. The tandem ZAP SH2 domains bind to the Zeta ITAM (FIGS. 2 and 3). but not to the 
Beta ITAM (FIG. 6. col. 3). The Src SH2 domain efficiently binds to the Beta ITAM (FIG. 6) but not 
to Zeta ITAM residues. As observed when using the ZAP SH2 domains and Zeta ITAMs. Src SH2- 
dependent two-hybrid formation can be rendered inducible via the use of the estrogen receptor 
ligand binding domain residues, as discussed be4ow. 

EXAMPLE 6: POST-TRANSCRIPTION ALLY INDUCED TWO-HYBRID FORMATION* A 
STEROID DEPENDENT TWO-HYBRID ASSAY FOR MAMMALIAN CELLS 

A Estrogen regulation of Zeta ITAM-tandem ZAP SH2 dependent two-hybrid formation in 
mammalian cells (transient transtections) 

The mammalian G5-IL2 KT 1080 fibrosarcoma cell line (Example 2) is transfected with two 
effector plasmids by lipofection: 

1 ) pMerSZ (Example 1F) drrves the production of GAL4 DBD-ER l.b.d -vSrc kinase-Zeta 
ITAM fusion protein in mammalian cells. 

2 ) pMAZ22. VP16 TAD-tandem ZAP SH2 (Example 1E). 

For controls and comparisons, the G5IL-2 HT1080 cells were also transfected with earner 
DNA. or with pMSZ onry. or with pMSZ and pMAZ22. or with pMerSZ onry. 
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Each of these transfected mammaiian cells are seeded in 6 well dishes and serial dilutions of 
test compounds are added to the wells. Twenty-lour hours after transtection, one duplicate dish of 
cells was cultured in the presence of 10 nM estrogen. 

Forty-eight hours after transtection, an aliquot of media from all of the dishes was removed 
trom the 6 well dish and placed into a 96 well dish to perform a chromogenic or fluorescence assay 
tor SEAP activity. 

FIG 3 provides the results of this assay: The cell line transfected wtth carrier DNA ts 
reported in cols. 1 and 2; or with pMSZ (Example 1D) in cols. 3 and 4; or with both pMSZ and 
pMAZ22 in cols. 5 and 6 Cols. 7 and 8 are the results of the ceB line transfected only with plasmid 
pMerSZ. Cols. 9 and 10 are results ot the cell line transfected with both pMerSZ and pMAZ22. as 

oascribed a^e^ ^ ^ estrogen receptor l.b.d. residues to the GAL4 DBD-vSrc kinase-Zeta ITAM 
fusion protein makes two-hybnd formation estrogen-dependent (compare FIG. 3. cols. 9 - 10) 
That is in the absence of exogenously added estrogen, the fusion protein (GAL4-DBD-ER l.b.d - 
vSrc kinase-g-ITAM) is not functional. In fact. FIG. 3, col. 9 shows that only low levels of SEAP 
are detected in the media. This is presumably due to the low levels of estrogen present in serum 
Low levels of SEAP are not observed il the celts are cultured in a defined synthetic media lacking 
estrogen When plasmids pMerSZ and pMAZ22 are expressed together in cells, two-hybnd 
formation (reporter gene transcription) should not occur (col. 9). unless estrogen is present in 
the media (col. 10). When these transfected cells were exposed to 10 nM estrogen, copious 
amounts of SEAP was released into the media. 

Estrogen did not markedly affect two-hybrid formation if the estrogen receptor l.b.d. was not 
present in e.ther plasmid expressing fusion protein (coU. 5 and 6). Also, the estrogen receptor 
Lb.d. residues do not make the GAL4 DBD-vSrc kinase-Zeta ITAM fusion protein itself a 
transcriptional activator (FIG. 3. Cols. 7 and B). 

SEAP induction was time-dependent, the majority of the SEAP was produced 12-18 hours 
after exposure of the ceils to estrogen (see FIG. 5 tor analogous results with stabry Transfected 
cells). 

B. Estrogen regulation of Beta ITAM-src SH2-dependent two-hybnd formation tn 
mammalian celis (transient transtection). 

In a manner similar to thai descr.bed above for the Zeta ITAM constructs. G5-IL2 HT 1080 
cells were transacted with the following plasmid DNA (m duplicate): pMerSB or pMerSB 
pMAS2 Twenty-four hours after transtection. one dish of celts was cultured in the presence ot 1U 
nM estrogen. Another twenty-tour hours later, an aliquot ot media trom all of the d.shes was 

ana ^As d shov^ P F!a1 ty the presence <♦) or absence <-) of estrogen is indicated, and the ^columns 
represent cells transfected wtth pMerSB (Example IL) [Cols. 1 and 2] or with pMerSB and pMAS2 
(Example 1N) [Cols 3 and 4). The addition of the estrogen receptor l.b.d. residues to the GAL4 
DBD-vSrc kinase-Beta ITAM fusion protein makes two-hybhd formation estrogen-dependent 
(compare cols. 3 and 4). In contrast, the estrogen receptor l.b.d. residues do not make the GAL4 
DBD-vSrc kinase-Beta ITAM fusion protein itself a transcriptional activator (cols. 1 ana d) 

C Stable cell line that constitutive!/ expresses GAL4 DBD-estrogen receptor t.b.d.-Src 
kmase-g-!TAM and ZAP-SH2 activation aomains tn mammahan celis. 

G5IL-2 HT10B0 cells that contain an integrated GAL4-0ependent SEAP reporter gene 
(Example 2) were transfected with pBabeNeo (neomycin selection), pMerVP (Example 10). and 
the two effector plasmids that drive the constitutive products of GAL4 DBD-estrogen receptor 
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t.b.d.-vSrc kinase-Zeta IT AM and VP 16 TAD-tandem ZAP SH2. i.e., pMerSZ (Example 1F) and 
pMAZ22 (Example 1E), respectively, were transfected into The cells with neomycin. 

After selection with both hygromycm and neomycin, cell lines were generated that contatn the 
reporter gene (hygromycin selection) and both effector ptasmids (neomycin selection). Individual 
colonies were isolated and expanded. One set of duplicate cells was exposed to 10 nM estrogen. Both 
duplicate sets of cells were assayed tor estrogen-induced (^inducer) SEAP activity. Those with 
markedly induced SEAP production were expanoed for further analysis. 

Of the twelve colonies containing MerSZ and MAZ22 that were analyzed, five exhibited high 
levels of SEAP activity after exposure to estrogen (FIG. 4). Two of these ctones (clone A and clone 
D) exhibited very high levels of estrogen-induced SEAP gene expression. The amount of SEAP in 
the media induced 200-fold (cell line #3) and 600-fold (cell line #6) after the addition of 10 nM 
estrogen. Stable cell lines that combmed estrogen-regulated MerUP were also obtained (Example 
6D). 

The time course of SEAP production this stable cell line with inducible Zeta ITAM-tandem ZAP 
SH2-dependent two-hybrid activity was studied as followed. Clone D (FIG. 4) was cultured in the 
presence of 10 nM estrogen and. at various times after the addition of estrogen, culture media was 
assayed tor the presence of SEAP actrvity. As shown in FIG. 5, the bulk of SEAP production occurs 
by 12 hours. Clone D cells do not produce appreciable amounts of SEAP when estrogen is 
exogenously added to the media (see FIG. 4). Also, the parent cells (G51L-2 HT1080 cells) do not 
produce SEAP when cultured in the presence of 10 nM estrogen for 24 hours (compare -E, +E 
columns of FIG. 5). 

D. Control Ceil Line 

When screening for compounds that block SH2-depenaent two-hybrid formation (SEAP 
production), compounds that have nonspecific effects will also be identified. Sucn compounds 
include those that are cytotoxic, those that affect transcription, translation or secretion, those that 
affect GAL4 DNA binding or the estrogen receptor ligand binding domain. Hence, compounds that are 
identified using the SH2 dependent assay are tested using a cell line that constitutively expresses 
GAL4 DBD-estrogen receptor l.b.d -VP16 fusion protein 

The results wrth one clone (cell line 6), containing an inducible GAl4-estrogen receptor 
l.b.d.-VP16 transcriptional activator in the presence (+) or absence {-) of estrogen, are shown in 
FIG. 8. Compounds that exert nonspecific effects as descnbed above will score positive when using 
this cell line. 

EXAMPLE 7: METHODS FOR CHARACTERIZING BINDING ACTIVITY OF AN INHIBITOR 
IDENTIFIED BY THE METHOD OF THIS INVENTION 

Several methods for characterizing binding activity (e.g.. counter-screens) of an inhibitor 
identified by the present invention are discussed beiow. where the PBD is an SH2 domam of 
interest. 

1 . Competitive Binding Assays; 

Binding is measured by competition usmg surface plasmon resonance [Malmqvist, M., Curr 
Qpin Immunol^ 5 282-286 (1993)) as implemented in the BIAcore® Biosensor (Pharmacia 
Biosensor, Piscataway, NJ). SK2 proteins, e.g. pp60src. pp70ZAP. or pp72syk. are 
pre-incubated with vanous concentrations of test compound and the ability of the test compound to 
competitively inhibit binding to a phophopeptide ligand measured. Results are compared to bindmg 
measured in the absence of competrtor and expressed as percent inhibition. 

IC50 values reflect the concentration of inhibitor required to reduce binding by 50%. 
Specifics of individual assays are descnoed below. All assays are run in HEPES Buffered Saiine 
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(HBS) composed of 10 mM HEPES (pH 7.4) / 150 mM NaCI / 3.4 mM EDTA / 0.05% Surfactant 
P20 at 25°C and a flow rate of 5 uL min-t. 

a. Specifics of Tandem Syk Assay 

A pp72syk peptide ligand corresponding to the g-chatn ITAM ol human FceRI 
IDGVY(P04)TGLSTRNQETY(P04)ETLK [SEO ID NO:17D was synthesized as part of a larger peptide 
[Ac-CGGDGVY(P04)TGLSTRNQETY-(P04)ETLK«NH2 [SEQ ID NO:18]] and used to generate a 
Syk-sensitive biosensor surface. Specifically, a Biosensor Chip CMS was activated with 200 mM 
ethyl-3-(3-dimethylaminopropyl)-carbodiimide hydrochloride (EDC)/ 50 mM 
n-hydroxysuccinimide (NHS) to generate a surface reactive to primary amines; treated with 
ethylenediamine to generate a surface rich in pnmary amines; activated with 
m-maleimidobenzoyl-n-hydroxysuccinimide ester (suHo-MBS; 50 mM in 25 mM NaHC03) to 
generate a surface reactive to tree thiols; and the ITAM peptide immobilized through the 
n-terminal cysteine. Unreacted sites were blocked with b-mercaptoethanol and the chip cleaned of 
noncovalentry bound peptide using 6 M guankfme hydrochlonde. Assays were run in HBS using 20 
nM pp72syk(1-265) W- test inhibitor. 

b. Specifics of C-Syk Assay 

A pp72syk peptide iigand corresponding to a hemiphosphorylated g-chain ITAM of human 
FceRI [DGVY(P04)TGLSTRNOETYETU< [SEO ID NO:19]] was synthesized as part of a larger peptide 
[Ac-CGGDGVY(P04)TGLSTRNQETYETLK-NH2 [SEQ ID IMO:20]) and used to generate a 
C-Syk-sensitive biosensor surface as described above tor tandem syk. Assays were run in HBS 
using 270 nM pp72syk(1 63-265) W- test inhibitor. 

c. Specifics of Tandem ZAP Assay 

A pp702AP peptide ligand corresponding to the z-chain 1TAM-1 of the human T-cell receptor 
[NQLY{P04)NELNIGRREEY(P04)DVLD [SEQ ID NO:21J] was synthesized as a part of a larger 
peptide [Ac-KGGNOLY(P04)NELNIGRREEY-(P04)DVLD-NH2 [SEQ ID NO:22] and used to generate a 
ZAP-sensitive biosensor surface. Specifically, a Biosensor Chip CM5 was activated with 200 mM 
EDC / 50 mM NHS to generate a surface reactive to pnmary amines and the ITAM peptide 
immobilized through the n-terminal lysine. Unreacted sites were blocked with ethanolamine (1 M 
in water) and the chip cleaned of noncovalently bound peptide using 6 M guantdine hydrochloride 
Assays were run in HBS using 10 nM pp70ZAP(1 -259) +/- test inhibitor. 

d Specifics of Src Assay 

A P 60src peptide ligand corresponding to the hamster Middle-T antigen [QY(P04)EEIPI [SEO 
ID NO:23]] was synthesized as a part of a larger peptide |Ac-KGGOY(P04)EE1PI-NH2 [SEQ ID 
NO:24]] and used to generate a src-sensitive biosensor surface as described above for tandem ZAP. 
Assays were run in HBS using 270 nM pp60src(1 44-251 ) +/- test inhibitor. 

EXAMPLE 8: YEAST-BASED TWO-HYBRID SCREENING ASSAYS 

The constitutive as well as inducible Src SH2- and ZAP SH2 dependent two-hybnd systems 
described here using mammalian cells are also useful with yeast cells. A yeast strain which 
contains an integrated reporter gene consisting of the ft-galactosidase gene linked to a promoter 
that is GAL4-dependent has been demonstrated. Two-hybrid formation induces G-galactosidase 
production, which can be detected using an enzyme assay, or from the blue color produced when 
cells producing enzyme are exposed to X-GAL. As with mammalian cells, SH2-deoenoent 
two-hybrid formation in yeast can be rendered estrogen-dependent using the estrogen receptor 
ligand binding domain. 

A yeast strain such as the one that contains a mutation in the srbl gene is used tor tne 
two-hybrid assay. Cells containing the ts1 allele of srbl exhibit increased endocytosis and have 
generally increased permeability. Cells are engineered so that plasmids used tor the two-hybnd 
screen can be selectively propagated. 
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a) HIS3-1,2.4 triazole Assay 

In this assay, cells auxotrophic tor histidine are transformed with a HIS3 gene controlled by 
a promoter that contains LexA DNA binding sites. The tandem SH2 domains of ZAP are expressed in 
cells linked to the LexA DBD. This fusion protein binds to DNA, but is not a competent 
5 transcriptional activator. The TCR z ITAM sequence is also expressed in the cells, as part of a 
hybrid protein that includes a nuclear localization sequence, the Lck tyrosine kinase catalytic 
domain and a transcriptional actrvation domain. The Lck kinase domain should be active and in the 
nudeus, with the potential to phosphor/late the tyrosine residues of z ITAM sequence either in cts 
or in trans. 

J) The interaction of the phosphorylated z ITAM-trenscriptional activation domain fusion 

protein with the ZAP SH2 domains -LexA protein activates HIS3 transcription, generating cells that 
can grow in media lacking histidine. Compounds that interfere with the interaction of the ZAP SH2 
domains with phosphorylated z ITAM will reduce the level of HIS3 transcription. Compounds that 
efficiently block this interaction prevent yeast growth in media lacking histidine. However, 

5 compounds that weakly disrupt ZAP SH2-Z ITAM association are not detected as histidine Droduction 
by the cell must be dramatically reduced to affect viability. 

The sensitivity of this assay can be adapted to be able to detect molecules that weakly block 
ZAP SH2-Z ITAM association. Cells are grown in the presence of 3-amino-1 ,2.4-triazole. a 
competitive inhibitor of histidine. Cells are cultured using a concentration of 

0 3-amino-1 ,2,4-triazole that is just below the amount required to affect cell viability. Compounds 
that trigger even a small reduction of HIS3 transcription are identified as the affected cells will no 
longer grow. 

There are two counterscreens useful to evaluate whether compounds that score in tne above 
assay exert their effect on growth by specif icaJly affecting ZAP SH2-Z ITAM association. In one 

5 counterscreen, the cells are cultured in the presence of histidine. ZAP SH2 inhibitors do not affect 
the growth of cells under this condition. In a second counterscreen, an identical yeast strain is 
established except that the transcnption of HIS3 is driven by a monomer of the LexA DBD 
genetically fused to the activation domain, thus the transcription of the H1S3 gene is not dependent 
on ZAP SH2-Z ITAM association. 

> Specifically, the yeast host cells in log phase growth are seeded in 96 well dishes (0.1 mis ol 

cells) in media that lacks histidine and contains 3-amino-l ,2.4-tnazole. Serial dilutions of 
compound are added to wells. The cultures are incubated 1-2 days at 30°C on a shaker platform 
The affect of compounds on growth is assessed using a 96 well plate reader. 

This assay depends on growth inhibition (the interference of two-hybrid dependent HI S3 

? transcription) as the readout. Secondary assays should be performed to confirm that the 
compounds that scored positive are not toxic to yeast, 
b ) URA3/5-FOA Assay: 

The second approach for identifying molecules that inhibit ZAP SH2 binding in yeast scores 
inhibitory compounds based on the ability to block ZAP two-hybrid dependent cell death. In this 

) assay, the tandem ZAP SH2 domain is expressed in yeast fused to the LexA DBD and the z ITAM- Lck 
domain is fused to transcnptional activation as in the constructions described above. However in 
this assay, the reporter gene, which is regulated by the LexA DBD is the URA3 gene. 

Cells which express the URA3 gene product die when exposed to 5-fiuoro-orotic acid 
(5-FOA). The product of the URA3 gene converts 5-FOA to fluorodeoxyuridine, which is a potent 

; inhibitor of thymidyiate synthetase. Hence 5-FOA is conditionally toxic to cells. Compounds that 
interfere with tandem ZAP SH2-Z fTAM complex formation reduce the amount of URA3 gene 
product. By carefully titrating the amount of two-hybrid" transcriptional activator in the cell 
and by using the lowest concentration of 5-FOA that will still kill cells, conditions are defined 
where compounds that disrupt the ZAP SH2-Z ITAM interaction are identified. Such molecules 
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educe the URA3 gene product levels (and hence the tluorodeoxyuridtne levels) betow the level 
vhere toxic amounts of lluorodeoxundine levels are formed. 

In contrast to the assay of 8A above, toxic compounds are not scored when using the 
URA3/5-FOA" screen, which is a cell viability assay. 

EXAMPLE 9: ASSOCIATION OF SRC PROTEIN WITH TYROSINE PHOSPHORYLATED 

* ROT Sevfra| N proteins have been reported to coprecipltate with v-Src from extracts of 
,-Src-fransformed cells [Kanner. S.B. et a... EMBCjL. lfl: 1689-1 698 (1991): Koch. C.A.. MfiL 
-ell fiioL 12 1366-1374 (1992)). The ability of a high affinity Src SH2-bindmg peptide 
■EPOpYEEIPI. denoted here as YEEI ISEQ ID N02S]] to block SH2 domain-dependent interactions is 
demonstratedjnjhis assay. ^ ^ SRD-3T3 cells (V-Src transformed 3T3 ceUs) were tysed in 
RIPA buffer and the Src protein was immunoprectpitated with a monoclonal antteody to the 
Tmino-ten^ nus of Src in the presence or absence of the SH2 and SH3 binding peptides. The 
ZS. ^tr^rred to nrtroceluto.e. and the b.o, was probed wHh a J^-^STs^ 
ohosphotyrosine. Two map. tyros.ne phosphoryiated proteins co-migrated with v-Src frorn^ SRD 
S?rS?". o« wh,ch co-migrated with the 62 kDa ty ro s.ne-phosphory..,.d prote.n 
,o the Src SH3 domain, and the other protein of Mr 130.000. which .s most Wce.y the _ 30.000 
kDa protein previously detected in v-Src immunopreciprtates [Koch. C.A.. M0I UfilL OIPU . 

12*1366*1374 (1992)) 

The binding of Src to the 62 kDa protein was reduced s.gnificantly by incubation of the 
extra* wS the^Src-pro" peptide or the phosphoryiated YEEI [SEO .D NO:2S] ^to^TsM 
PO.y-prol.ne nor the unphosphory.a.ed YEEI [SEQ ID NO:25] pept.des. s ^9^ I T^TJTowfn 
and SH3 domains of Src are required tor stable association of this protein with Src. This P'°»™ 
has no?beeTde.an.,ned to be the p62 protein identified in the total cel. SH3 bind.ng proteris since 
me p?2 antibody is not sensitive enough to recogmze the smal, amounts oMJ « WD. , '°«™ ™ 
associate with Src unOer these conditions. The "Src-pro" pept.de and the pYEEl {S EQ _ID NO 25} 
pepS ol only b.ock 50% ol p1 30 b.nd.ng to Src. wnereas 90% inhibition of pj 30 b.nd.ng was 
observed when peptides were used, suggest.ng that e,her the SH2 or the SH3 doma.n ,s 
c,irfir->«ni for some degree of p130 association with Src. 

sufficem Jor soma d g £ ^ ^ ^ ^ ^ cmbod)ments descnbed here^ 

indeed, venous modifications of the mvent.on in addition to those described .herein will 
apparent ,o those skilled in the art from the foregoing descr.pt.on. Such mod.ficat.ons are intended 
to fall within the scope ol the appended claims. 

The disclosures ol the patents, patent applications and publications cited herein are 
incorporated by reference in their entireties. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(0 APPLICANT: ARIAD PHARMACEUTICALS. INC. 
RICKLES, Richard J. 

(ff) TITLE OF INVENTION: CELL-BASED ASSAY 

(Hi) NUMBER OF SEQUENCES: 25 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: ARIAD Pharmaceuticals, Inc. 

(B) STREET: 26 Landsdowne Street 

(C) CITY: Cambridge 

(D) STATE: Massachusetts 

(E) COUNTRY: USA 

(F) ZIP: 02139-4234 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-OOS 
(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPUCATtON DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 
(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/012.218 
(B) FILING DATE: 23-FEB-1996 

(vifi) ATTORNEY/AGENT INFORMATION: 

(A) NAME: BERSTEIN. David L. 

(B) REGISTRATION NUMBER: 31,235 

(C) REFERENCE/DOCKET NUMBER: ARIAD 336-PCT 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617-494-0400 

(B) TELEFAX: 617-494-0208 

(2) INFORMATION FOR SEQ ID NO:1: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1 : 
CGGAATTCTC CAAGCCCCAG ACCCA 
(2) INFORMATION FOR SEQ ID NO:2: 

0 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

1 5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

20 

GCGGATCCCT CAGCGACCTC CAACA 
(2) INFORMATION FOR SEQ ID NO:3: 

2 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

3 5 CGGAATTCTC TGCTGGAGAC ATGAGAGCT 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

4 o (A) LENGTH: 28 base pars 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

4 5 (fi) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
CGGAATTCGA CTGTGGCAGG GAAACCCT 
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(2) INFORMATION FOR SEO ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucteic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID HCtS: 
GCGGATCCGG AGCTGGGGAA GAACTCA 27 
(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEO ID NO:6: 
ACGCGTCGAC TTATAAATCA ATGGGAGGAG 30 
(2) INFORMATION FOR SEQ ID NO:7: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 
CCTCACGAAT TCGGTGGAGT GACCACCTTT GTGGCC 36 
(2) INFORMATION FOR SEQ ID IMO:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

^ACTCGGAT CCGCCGGGGC ACACGGTGGT GAGGC 35 
! INFORMATION FOR SEQ ID NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

rrCACGAAT TCGGCGACTC CATCCAGGCT GAGGAG 36 
, INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(if) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

lACTCGGAT CCGCCGGGGC ACACGGTGGT GAGGC 35 
, INFORMATION FOR SEQ ID NO:11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:11: 

rrCACGAAT TCGGTGGAGT GACCACCTTT GTGGCC 36 
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(2) INFORMATION FOR SEQ ID NO: 12: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pars 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cONA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 
CCACTCGGAT CCGCCGGGGC ACACGGTGGT GAGGC 35 
(2) INFORMATION FOR SEQ ID NO:13: 

(?) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 
CCTCACGAAT TCGGCGACTC CATCCAGGCT G AGGAG 36 
(2) INFORMATION FOR SEQ ID NO:14: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 
CCACTCGGAT CCGCCGGGGC ACACGGTGGT GAGGC 35 
(2) INFORMATION FOR SEQ ID NO:15: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 
(xi) SEQUENCE DESCRIPTION: SEO ID NO:15: 
aGGATCCAG CCTGGGGGAC GAGCTC 26 
) INFORfJlATION FOR SEO ID NO:16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pair* 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEO ID NO:16: 
G ACT AGTCC CACCGTACTC GTCAAT 26 
) INFORMATION FOR SEQ ID NO:17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modlfied-site 
(B) LOCATION: 4 

(D) OTHER INFORMATION: mote- -pnosphoryiatecT 

(Ix) FEATURE: 

(A) NAME/KEY: Modified-stte 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /note- •phosphorytated 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 

Asp Gly Val Tyr Thr Gry Leu Ser Thr Arg Asn Gin Glu Thr Tyr Gtu 
1 5 10 15 

Thr Leu Lys 
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(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modtfied-sHe 
(B) LOCATION: 1 

(D) OTHER INFORMATION: /note* "N-acetytated* 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 
(B) LOCATION: 7 

(D) OTHER INFORMATION: /note= •phosphorylatecT 

(ix) FEATURE: 

(A) NAME/KEY: Modified-aite 

(B) LOCATION: 18 

(D) OTHER INFORMATION: /note- "phosphorylated" 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 
(B) LOCATION: 22 

(D) OTHER INFORMATION: /note« "arnidated" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 

Cys Gfy Gly Asp Gly Val Tyr Thr Gly Leu Ser Thr Arg Asn Gin Glu 
15 10 15 

Thr Tyr Glu Thr Leu Lys 
20 

(2) INFORMATION FOR SEQ ID NO:19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(n) MOLECULE TYPE: peptide 
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(ix) FEATURE: 

(A) NAME/KEY: ModHled-slte 
(B) LOCATION: 4 

(D) OTHER INFORMATION: /note* "phosphorytated' 

(xi) SEQUENCE DESCRIPTION: SEO ID NO:19: 

Asp Gry Val Tyr Thr Gty Leu Ser Thr Arg Asn Gin Glu Thr Tyr Glu 
1 M i 5 10 15 



Thr Leu Lys 



) INFORMATION FOR SEO ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modffled-stte 
(B) LOCATION: 1 

(D) OTHER INFORMATION: mote= "acetylated" 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 
(B) LOCATION: 7 

(D) OTHER INFORMATION: /note= "phosphorytated" 

(ix) FEATURE: 

(A) NAME/KEY: Modffled-sJte 
(B) LOCATION: 22 

(D) OTHER INFORMATION: mote* "amidated- 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO-.20: 

Cys Gly Gty Asp Gry Val Tyr Thr Gly Leu Ser Thr Arg Asn Gin Glu 
1 5 10 15 

Thr Tyr Glu Thr Leu Lys 
20 

I) INFORMATION FOR SEQ ID NO:21 : 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 
(B) LOCATION: 4 

(D) OTHER INFORMATION: /note= •phosphorytated" 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 
(B) LOCATION: 15 

(D) OTHER INFORMATION: /note- "phosphorylated" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

Asn Gin Leu Tyr Asn Glu Leu Asn lie Gly Arg Arg Glu Glu Tyr Asp 
1 5 10 15 

Val Leu Asp 



(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 
(B) LOCATION: 1 

(D) OTHER INFORMATION: /note= -N-acetylated' 

(tx) FEATURE: 

(A) NAME/KEY: Modified-site 
(B) LOCATION: 7 

(D) OTHER INFORMATION: /note* •phosphorylated" 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 
(B) LOCATION: 18 

(D) OTHER INFORMATION: /note= -phosphoryiated' 
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(ix) FEATURE: 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 22 

(D) OTHER INFORMATION: /note= •amidatecr 

(xi) SEQUENCE DESCRIPTION: SEO ID NO:22: 

Lys Gly Gly Asn Gin Leu Tyr Asn Glu Leu Asn He Gty Arg Arg Giu 
" 5 10 15 

GkJ Tyr Asp Val Leu Asp 
20 

, INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(he) FEATURE: 

(A) NAME/KEY: Modlfied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= "phosphorylated" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Gin Tyr Glu Glu He Pro lie 
1 5 

l INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino add 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modffled-stte 
(B) LOCATION: 1 

(D) OTHER INFORMATION: /note= "N-acetylated" 
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(ix) FEATURE: 

(A) NAME/KEY: Modlfied-stte 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /note- "pnosphorylated" 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 
(B) LOCATION: 10 

(D) OTHER INFORMATION: /note- "amidated - 

(xi) SEQUENCE DESCRIPTION: SEQ ID N024: 

Lys Gly Gly Gin Tyr Glu Glu lie Pro lie 
1 5 10 

(2) INFORMATION FOR SEQ ID NO:25: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS: 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

Tyr Glu Glu lie 
1 
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CLAIMS 

A cell containing and capable of expressing recombinant DNA molecules encoding two fusion 

'^'("a) a first fusion protein comprising one or more copies of each of a transcription 
ivation domain and one member of a phosphopeptide binding pair, 

( b ) a second fusion protein comprising one or more copies of each of a DNA-binding domain 
1 the other member of the phosphopeptide binding pair, 

herein the phosphopeptide binding pair comprises (I) a peptide ligand containing one or 
,re tyrosine residues and (ii) a phosphopeptide binding domain capable of binding to the peptide 
ind upon phosphorylation of one or more of the peptide ligands tyrosine residues, and 

wherein the cell further contains a reporter gene which is linked to a DNA sequence to which 
d second fusion protein is capable of binding and which reporter gene encodes a detectable gene 
,duct which is expressed when said transgene is activated by proximity to a transcriptional 
ovation domain. 

The cell according to claim 1 wherein said phosphopeptide binding pair comprises an SH2 
main and a ligand therefor, a phosphatase domain and a ligand therefor, or a PTB phosphopeptide 
(ding domain and a ligand therefor. 

The cell according to claim 1 wherein the recombinant DNA molecules encode 

( a ) a first fusion protein which comprises one or more copies of a transcription activation 

main and a phosphopeptide binding domain, and ki~« m ^.in 

/ b ) a second fusion protein which comprises one or more copies of a DNA-binding domain 

d the peptide ligand capable, upon phosphorylation, of binding to the phosphopept.de binding 



main. 



The cell according to claim 3 wherein the second fusion protein further comprises a kinase 



main. 



The cell according to daim 3 wherein the first fusion protein contains two phosphopeptide 
xling domains. 

The cell according to claim 1 wherein one of the fusion proteins further contains a hormone 
iding domain permitting regulated sequestering of the fusion protein. 

The cell according to claim 1 , which is mammalian. 

The cell according to claim 1. further comprising an endogenous kinase domain. 

I^methodfor identifying the presence in a test composition of an inhibitor of the binding of a 
wsphopeptide binding pair which comprises the stepa of: . |ipiiwl 

(a) providing cells of claim 4 or 8. which when cuhured are capable of producing 
easuraWe amounts of said detectable gene product, 

( b ) culturing said cells in the presence and absence of the test composition, and 

(c) determining whether production of the detectable gene product was diminished In the 
esence ot the tost composition. 

0. The method according to claim 9 wherein said cell is a mammalian cell. 
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1 1 . The method according to claim 9, wherein the DNA-binding domain and transcriptionaJ 
activation domain are derived from transcriptional activators having separable DNA-binding and 
transcriptional activation domains. 

1 2 . The method according to claim 1 1 . wherein the DNA binding domain and the transcriptionaJ 
activation domain are selected from the group consisting of transcriptional activators GAL4, GCN4 
and ADR1. 

13. The method according to claim 9, wherein the DNA-binding domain and the transcriptional 
activation domain are from different transcriptional activators. 

14. The method according to claim 9 wherein said phosphopeptide binding pair comprises an SH2 
domain and a ligand therefor, a phosphatase domain and a ligand therefor, or a PID phosphopeptide 
binding domain and a ligand therefor. 

1 5 . The method according to claim 9 wherein the first fusion protein contains two phosphopeptide 
binding domains. 

16. The method according to claim 9 wherein one of the fusion proteins further contains a 
hormone binding domain permitting regulated sequestering o1 the fusion protein. 

1 7 . The method according to claim 9 wherein said test composition is selected from a mixture of 
one or more test peptides, wherein sard mixture is provided in the form of a library of synthetic 
peptides or in the form of a phage library displaying the various peptides. 

1 B. The method according to claim 9 wherein said polypeptide ligand is an immunoreceptor 
tyrosine activation motif. 

1 9. The method according to claim 18 wherein said ITAM is the TCR zeta ITAM. 
20. An inhibitor identified by the method of claim 9. 
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